Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thepowerfuldeez
's Collections
Coding datasets
Coding datasets
updated
Sep 19, 2025
Datasets for high quality small LM model pre-training.
Upvote
-
thepowerfuldeez/the-stack-v2-train-smol-ids-updated
Viewer
•
Updated
Sep 12, 2025
•
409k
•
15
thepowerfuldeez/the-stack-v2-train-smol-ids-updated-content
Viewer
•
Updated
Sep 16, 2025
•
387k
•
4
thepowerfuldeez/the-stack-v2-extra-python-content
Viewer
•
Updated
Oct 21, 2025
•
10.6k
•
20
Upvote
-
Share collection
View history
Collection guide
Browse collections