Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
205516.8
TFLOPS
187
52
66
Leandro von Werra
lvwerra
Follow
vikashkum7619's profile picture
GigaBoy's profile picture
youssefel01's profile picture
321 followers
·
57 following
https://github.com/lvwerra
lvwerra
lvwerra
AI & ML interests
NLP and RL
Recent Activity
liked
a Space
about 21 hours ago
nanotron/ultrascale-playbook
published
a Space
about 21 hours ago
nanotron/ultrascale-playbook
new
activity
about 21 hours ago
nanotron/ultrascale-playbook:
fixes
View all activity
Organizations
Articles
29
Article
47
DABStep: Data Agent Benchmark for Multi-step Reasoning
Article
284
Open-R1: Update #1
View all Articles
Papers
14
arxiv:
2502.02737
arxiv:
2501.08365
arxiv:
2410.24198
arxiv:
2406.17557
Expand 14 papers
spaces
21
Sort: Recently updated
Sleeping
1
Executor
📚
Sleeping
3d Bench Viz
📈
Running
7
3d
🔥
Visualize 3D parallelism configuration
Running
10
Train LLMs
⚡
Calculate training cost and model efficiency
Sleeping
Text Source Viz
👁
Runtime error
20
Harm Space
⚡
Expand 21 spaces
models
33
Sort: Recently updated
lvwerra/the-tokenizer-v1
Updated
Feb 12, 2024
•
1
lvwerra/sc2
Updated
Feb 11, 2024
•
2
lvwerra/starcoder-98k-no-regex-no-digits
Updated
Sep 29, 2023
lvwerra/starcoder-393k
Updated
Sep 28, 2023
lvwerra/starcoder-196k
Updated
Sep 28, 2023
lvwerra/starcoder-98k
Updated
Sep 27, 2023
lvwerra/starcoder-24k
Updated
Sep 27, 2023
lvwerra/starcoder-12k
Updated
Sep 27, 2023
lvwerra/starcoder-6k
Updated
Sep 27, 2023
lvwerra/starcoderbase-gsm8k
Text Generation
•
Updated
Aug 30, 2023
•
20
Expand 33 models
datasets
22
Sort: Recently updated
lvwerra/dabstep
Viewer
•
Updated
16 days ago
•
3
•
3.02k
lvwerra/needle-llama3-16x524k
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
220
•
1
lvwerra/needle-llama3-16x65k
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
82
•
1
lvwerra/needle-llama3-16x8k
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
34
•
1
lvwerra/needle-llama3-16x512
Viewer
•
Updated
Apr 26, 2024
•
1.41k
•
31
•
1
lvwerra/admin
Viewer
•
Updated
Mar 6, 2024
•
1
•
295
lvwerra/stack-exchange-paired
Viewer
•
Updated
Mar 13, 2023
•
31.3M
•
1.93k
•
143
lvwerra/git-commits-clean
Updated
Mar 2, 2023
•
4
lvwerra/changeit
Viewer
•
Updated
Jan 8, 2023
•
31
•
126
lvwerra/code-ml
Viewer
•
Updated
Jan 4, 2023
•
1.5k
•
16
Expand 22 datasets