The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 1 day ago • 22
view article Article Announcing the Common Pile and Comma v0.1 By common-pile • about 11 hours ago • 9
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • 4 days ago • 37
view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H By Hcompany and 1 other • 3 days ago • 60
view article Article AutoThink: Adaptive Reasoning for Large Language Models By codelion • 10 days ago • 4
view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other • 10 days ago • 43
Step 1: Reproducing DeepSeek's Distilled Models Collection Code for training and evaluation: https://github.com/huggingface/open-r1 • 3 items • Updated 11 days ago • 2
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • 17 days ago • 26
view article Article NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning By PranjaliJoshi and 1 other • 18 days ago • 24
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools By albertvillanova • 22 days ago • 29
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 23 days ago • 112
INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning Paper • 2505.07291 • Published 26 days ago • 12
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning Paper • 2504.11354 • Published Apr 15 • 5
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate By muellerzr and 3 others • Jun 13, 2024 • 54
view article Article Empowering Public Organizations: Preparing Your Data for the AI Era By evijit and 1 other • Apr 10 • 15