Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Paper
β’
2506.01939
β’
Published
β’
128
Unofficial org for community upload of Mistral's Open Source models.
app_build_command: npm run build
in your README's YAML and app_file: build/index.html
in your README's YAML block.