view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other • 10 days ago • 43
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Paper • 2505.15966 • Published 16 days ago • 51
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana • 11 days ago • 40
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation Paper • 2505.14640 • Published 17 days ago • 14
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 17 days ago • 140
view article Article Microsoft and Hugging Face expand collaboration By jeffboudier and 2 others • 19 days ago • 20
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools By albertvillanova • 22 days ago • 29
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 23 days ago • 112
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • 26 days ago • 417
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? By danaaubakirova and 6 others • 27 days ago • 57
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control By danaaubakirova and 3 others • Feb 4 • 158
view article Article Gotchas in Tokenizer Behavior Every Developer Should Know By qgallouedec • Apr 18 • 37