TheMrCodes
TheMrCodes
·
AI & ML interests
None yet
Organizations
None yet
AI Safety
Interesting Datasets
LM Research
-
TinyLlama: An Open-Source Small Language Model
Paper • 2401.02385 • Published • 95 -
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Paper • 2401.01335 • Published • 68 -
Asynchronous Local-SGD Training for Language Modeling
Paper • 2401.09135 • Published • 11 -
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Paper • 2404.07143 • Published • 110
Interesting for LLM Products
Tiny MMLM
Knowledge Graph
Cool Papers
Image Gen
Milestomes
Read later list
Waiting for model weights
-
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
Multilingual E5 Text Embeddings: A Technical Report
Paper • 2402.05672 • Published • 23 -
Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization
Paper • 2408.08019 • Published • 11
Fundamental Research
Bio ML
Point Tracking Models
Cool Papers
AI Safety
Image Gen
Interesting Datasets
Milestomes
LM Research
-
TinyLlama: An Open-Source Small Language Model
Paper • 2401.02385 • Published • 95 -
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Paper • 2401.01335 • Published • 68 -
Asynchronous Local-SGD Training for Language Modeling
Paper • 2401.09135 • Published • 11 -
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Paper • 2404.07143 • Published • 110
Read later list
Interesting for LLM Products
Waiting for model weights
-
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
Multilingual E5 Text Embeddings: A Technical Report
Paper • 2402.05672 • Published • 23 -
Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization
Paper • 2408.08019 • Published • 11
Tiny MMLM
Fundamental Research
Knowledge Graph
Bio ML