Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FineInstructions Pretraining Corpora
Team
community
Activity Feed
Follow
7
AI & ML interests
None defined yet.
Recent Activity
AjayP13
updated
a dataset
12 minutes ago
fineinstructions-pretraining/nemotron_fineinstructions_1T
AjayP13
updated
a dataset
35 minutes ago
fineinstructions-pretraining/nemotron_fineinstructions_1T
craffel
authored
a paper
3 months ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
View all activity
Team members
2
fineinstructions-pretraining
's datasets
33
Sort: Recently updated
fineinstructions-pretraining/longform_synthetic_all
Viewer
•
Updated
Aug 11
•
27.7k
•
13
fineinstructions-pretraining/longform_actual_all
Viewer
•
Updated
Aug 11
•
27.7k
•
21
fineinstructions-pretraining/ipt_fineinstructions_all_raw_0
Viewer
•
Updated
Aug 11
•
106M
•
1.28k
Previous
1
2
Next