Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training Paper • 2502.11191 • Published 5 days ago • 2
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 4 days ago • 86
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 10 days ago • 48
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 17 days ago • 187
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper • 2501.18512 • Published 22 days ago • 27
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 21 days ago • 35
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 9 days ago • 90
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 29 days ago • 63
view article Article How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents By Steveeeeeeen • 23 days ago • 16
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 12 items • Updated 1 day ago • 76