R2R
Collection
Collections for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"
•
4 items
•
Updated
This is the default router from the paper R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing.
Roads to Rome (R2R) is a neural token router that efficiently combines Large Language Models (LLMs) and Small Language Models (SLMs) by selectively routing only critical, reasoning-divergent tokens to the large model.
Please visit our GitHub repo for more information.
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B