Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
|
@@ -42,14 +42,15 @@ training.
|
|
| 42 |
<img src="figs/framework.png" width="80%" />
|
| 43 |
</p>
|
| 44 |
""")
|
| 45 |
-
gr.Markdown(
|
| 46 |
"""
|
| 47 |
**Main Takeaways**
|
| 48 |
1. βοΈ Thinking + Solution are explicitly separated with independent budgets β boosting reliability under tight compute constraints.
|
| 49 |
2. π§ Budget-Constrained Rollout: We train models to handle truncated reasoning using GRPO.
|
| 50 |
3. π Flexible scalability: Robust performance across diverse inference budgets on reasoning benchmarks like AIME and LiveCodeBench.
|
| 51 |
4. βοΈ Better performance with fewer tokens: Our trained model generates outputs that are 30% shorter while maintaining (or even improving) accuracy.
|
| 52 |
-
|
|
|
|
| 53 |
<p align="center">
|
| 54 |
<img src="figs/aime.png" width="46%" />
|
| 55 |
<img src="figs/livecode.png" width="48%" />
|
|
@@ -59,7 +60,7 @@ gr.Markdown(
|
|
| 59 |
<img src="figs/codetable.png" width="90%" />
|
| 60 |
</p>
|
| 61 |
""")
|
| 62 |
-
gr.Markdown(
|
| 63 |
"""
|
| 64 |
## Citation
|
| 65 |
|
|
|
|
| 42 |
<img src="figs/framework.png" width="80%" />
|
| 43 |
</p>
|
| 44 |
""")
|
| 45 |
+
gr.Markdown(
|
| 46 |
"""
|
| 47 |
**Main Takeaways**
|
| 48 |
1. βοΈ Thinking + Solution are explicitly separated with independent budgets β boosting reliability under tight compute constraints.
|
| 49 |
2. π§ Budget-Constrained Rollout: We train models to handle truncated reasoning using GRPO.
|
| 50 |
3. π Flexible scalability: Robust performance across diverse inference budgets on reasoning benchmarks like AIME and LiveCodeBench.
|
| 51 |
4. βοΈ Better performance with fewer tokens: Our trained model generates outputs that are 30% shorter while maintaining (or even improving) accuracy.
|
| 52 |
+
""")
|
| 53 |
+
gr.HTML("""
|
| 54 |
<p align="center">
|
| 55 |
<img src="figs/aime.png" width="46%" />
|
| 56 |
<img src="figs/livecode.png" width="48%" />
|
|
|
|
| 60 |
<img src="figs/codetable.png" width="90%" />
|
| 61 |
</p>
|
| 62 |
""")
|
| 63 |
+
gr.Markdown(
|
| 64 |
"""
|
| 65 |
## Citation
|
| 66 |
|