L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
L3 Lab
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
5

l3lab/L1-Qwen-1.5B-Exact
2B
•
Updated
•
12.8k
•
5

l3lab/L1-Qwen-1.5B-Max
2B
•
Updated
•
3.53k
•
15

l3lab/ntp-mathlib-context-deepseek-coder-1.3b
Text Generation
•
Updated
•
52
•
3

l3lab/ntp-mathlib-st-deepseek-coder-1.3b
Text Generation
•
Updated
•
31

l3lab/ntpctx-llama3-8b
Text Generation
•
Updated
•
46
•
3
datasets
7
l3lab/lean-premises
Updated
•
53
•
1
l3lab/miniCTX-v2
Viewer
•
Updated
•
668
•
36
•
1
l3lab/miniCTX
Viewer
•
Updated
•
662
•
1.49k
•
3
l3lab/ntp-mathlib-instruct-context-fullproof
Viewer
•
Updated
•
144k
•
84
•
1
l3lab/ntp-mathlib-instruct-context
Viewer
•
Updated
•
614k
•
122
•
1
l3lab/ntp-mathlib
Viewer
•
Updated
•
213k
•
88
•
2
l3lab/ntp-mathlib-instruct-st
Viewer
•
Updated
•
307k
•
85