view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen • Jan 15 • 185
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 17 days ago • 140
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published 25 days ago • 78
view article Article I trained a Language Model to schedule events with GRPO! By anakin87 • Apr 29 • 76
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated Apr 10 • 87
view article Article You could have designed state of the art positional encoding By FL33TW00D-HF • Nov 25, 2024 • 288
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated 19 days ago • 148
👩💻 OlympicCoder Collection Reasoning datasets and models for competitive coding • 4 items • Updated 24 days ago • 17
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 18 days ago • 114
view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other • Feb 25 • 162
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published Feb 10 • 61