Teaching language models to think efficiently with Adaptive Length Penalty (ALP)
AI & ML interests
Scaling up good synthetic reasoning. Post-training and synthetic data research lab.
Recent Activity
View all activity
Organization Card
This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers
-
SynthLabsAI/Big-Math-RL-Verified
Viewer • Updated • 251k • 5.57k • 185 -
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer • Updated • 34.9k • 19 • 1 -
nlile/NuminaMath-1.5-RL-Verifiable
Viewer • Updated • 131k • 6.57k • 5 -
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Paper • 2502.17387 • Published • 6
Teaching language models to think efficiently with Adaptive Length Penalty (ALP)
This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers
-
SynthLabsAI/Big-Math-RL-Verified
Viewer • Updated • 251k • 5.57k • 185 -
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer • Updated • 34.9k • 19 • 1 -
nlile/NuminaMath-1.5-RL-Verifiable
Viewer • Updated • 131k • 6.57k • 5 -
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Paper • 2502.17387 • Published • 6
datasets
6
SynthLabsAI/Big-Math-RL-Verified
Viewer
•
Updated
•
251k
•
5.57k
•
185
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer
•
Updated
•
34.9k
•
19
•
1
SynthLabsAI/PERSONA
Viewer
•
Updated
•
200k
•
3.64k
•
16
SynthLabsAI/PERSONA_subset
Viewer
•
Updated
•
5k
•
3.51k
•
1
SynthLabsAI/PRISM-Filter
Viewer
•
Updated
•
3.87k
•
7
SynthLabsAI/Synthetic-Personas
Viewer
•
Updated
•
1k
•
9
•
1