SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper β’ 2503.11576 β’ Published 11 days ago β’ 74
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper β’ 2502.18449 β’ Published 28 days ago β’ 71
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper β’ 2502.07316 β’ Published Feb 11 β’ 47
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 212
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct β’ 5 items β’ Updated Feb 20 β’ 35
π» Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos β’ 14 items β’ Updated Feb 20 β’ 51
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper β’ 2406.17557 β’ Published Jun 25, 2024 β’ 94
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Paper β’ 2405.18392 β’ Published May 28, 2024 β’ 12
Leaderboards and benchmarks β¨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... β’ 91 items β’ Updated 25 days ago β’ 99
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community β’ 17 items β’ Updated Jun 6, 2024 β’ 236