Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
76
77
Quentin Gallouédec
PRO
qgallouedec
Follow
Mkylmr's profile picture
Beegbrain's profile picture
mtglearn's profile picture
309 followers
·
264 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
commented
on
their
article
about 7 hours ago
No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL
upvoted
an
article
about 14 hours ago
No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL
published
an
article
1 day ago
No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL
View all activity
Organizations
Articles
7
Article
17
No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL
Article
37
Gotchas in Tokenizer Behavior Every Developer Should Know
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
5
Sort: Recently updated
Sleeping
Tmp
🚀
Runtime error
2
Run Hello World
👀
Sleeping
Compute
👁
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
14
Train Memory
📈
Generate memory forecast for ML models
models
731
Sort: Recently updated
qgallouedec/Qwen3-0.6B-SFT
Updated
9 days ago
qgallouedec/Qwen2.5-0.5B-SFT
Updated
10 days ago
qgallouedec/SmolLM2-360M-Rickified-GRPO
Text Generation
•
Updated
13 days ago
•
54
•
1
qgallouedec/SmolLM2-360M-Rickified
Text Generation
•
Updated
14 days ago
•
625
qgallouedec/SmolLM2-360M-SFT
Text Generation
•
Updated
26 days ago
•
6
qgallouedec/R1-Zero-Qwen-7B-Math
Text Generation
•
Updated
May 1
•
123
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
Apr 8
•
10
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
Apr 7
•
18
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Mar 26
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 24
Expand 731 models
datasets
72
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
7 days ago
•
120k
•
316
•
1
qgallouedec/rick-physics-grpo
Viewer
•
Updated
13 days ago
•
1.79k
•
197
•
1
qgallouedec/rick-science
Viewer
•
Updated
18 days ago
•
1.18k
•
192
•
1
qgallouedec/physics-problems
Viewer
•
Updated
25 days ago
•
247
•
56
qgallouedec/rick-teaches-math
Viewer
•
Updated
25 days ago
•
6.8k
•
103
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
Apr 29
•
16.4k
•
91
•
2
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
54
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
44
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
33
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
41
Expand 72 datasets