Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
-
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
Paper • 2506.09033 • Published • 6 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct
3B • Updated • 8 • 1 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct-Alpha0.9
3B • Updated • 4 -
ulab-ai/Router-R1-Llama-3.2-3B-Instruct
4B • Updated • 6