Lewis Tunstall PRO
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
updated
a Space
about 1 hour ago
open-r1/open-r1-eval-leaderboard
published
an
article
about 3 hours ago
Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques
updated
a Space
about 9 hours ago
open-r1/open-r1-eval-leaderboard
Organizations
lewtun's activity
Fix typo in dataset load
#3 opened 5 days ago
by
lewtun

Please add HF Inference Endpoint and library tags which allow easier deployment
1
#8 opened 7 days ago
by
SolshineMisfit

Mode changed to Model
2
#7 opened 7 days ago
by
Solshine

Update README.md
1
#6 opened 8 days ago
by
nickname100231
Omitted <think> at the start and almost 10k tokens to debug 2 JS functions
3
#2 opened 11 days ago
by
operationdarkside
It seems to overthink
1
#3 opened 8 days ago
by
sm54
Upload dataset
#4 opened 7 days ago
by
lewtun

missing </think> in all subset
2
#3 opened 7 days ago
by
volcanos

Why is there a discrepancy between the 'Solutions' subset and the 'Solutions_py' subset?
1
#2 opened 11 days ago
by
waple

Trouble loading the dataset
2
#2 opened 11 days ago
by
lewtun

Update README.md
1
#1 opened 12 days ago
by
lhoestq

Size of the weights > 140 GB for a 32 GB model?
1
#2 opened 12 days ago
by
stelterlab

Remove fp32 weights
#4 opened 12 days ago
by
lewtun

Remove fp32 weights
#3 opened 12 days ago
by
lewtun

[Paper review] Small Models Struggle to Learn from Strong Reasoners
#19 opened 29 days ago
by
lewtun

⚠️ Chat template foot gun with DeepSeek distilled models and RL format reward function
6
#17 opened about 1 month ago
by
lewtun

the finetune config of open-r1?
2
#6 opened about 1 month ago
by
MilyFang
Update README.md
3
#1 opened about 1 month ago
by
davidberenstein1957

[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
22
#15 opened about 1 month ago
by
lewtun

System Prompt
3
#3 opened 2 months ago
by
Wanfq
