Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 3 days ago • 123
Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi Paper • 2504.06011 • Published Apr 8 • 1
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 16 days ago • 136
view article Article Falcon-Arabic: A Breakthrough in Arabic Language Models By tiiuae and 7 others • 16 days ago • 30
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • 16 days ago • 25
Falcon-H1 Collection Falcon-H1 Family of Hybrid-Head Language Models, including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained and instruction-tuned). • 37 items • Updated 16 days ago • 38
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 22 days ago • 110
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. By tiiuae and 9 others • 22 days ago • 33
Falcon Edge series Collection A series of powerful, universal and fine-tunable small Language Models • 7 items • Updated 16 days ago • 22
view article Article Improving Hugging Face Model Access for Kaggle Users By roseberryv and 4 others • 23 days ago • 27
view article Article Blazingly fast whisper transcriptions with Inference Endpoints By mfuntowicz and 5 others • 24 days ago • 67
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • 25 days ago • 414
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? By danaaubakirova and 6 others • 26 days ago • 56
U-MATH and μ-MATH - University-level math evaluation Collection Paper: A UNIVERSITY-LEVEL BENCHMARK FOR EVALUATING MATHEMATICAL SKILLS IN LLMS • 4 items • Updated Jan 14 • 17
view article Article Creating your custom Ghibli Text-to-Image model By atlasia and 3 others • May 1 • 16