view article Article KV Cache from scratch in nanoVLM By ariG23498 and 4 others • 7 days ago • 63
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 8 days ago • 144
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 27 days ago • 113
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • about 1 month ago • 428
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & outperforms all leading quantization methods. • 33 items • Updated about 10 hours ago • 117