Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 3 days ago • 123
Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi Paper • 2504.06011 • Published Apr 8 • 1
Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi Paper • 2504.06011 • Published Apr 8 • 1
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 16 days ago • 136