view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 18 days ago • 578
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published 30 days ago • 63
ILSP Greek Evaluation Suite Collection A collection of test sets for evaluating base and chat LLMs (incl. VLMs) on Greek generation and understanding capabilities • 15 items • Updated Jun 18 • 3
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design Paper • 2506.04734 • Published Jun 5 • 19