Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
babylm-seqlen
's Collections
Single Shuffled Data
Double Shuffled Data
Double Shuffled Data
updated
15 days ago
Data shuffled at both the document-level, and again at the tokenized level
Upvote
-
babylm-seqlen/train_100M_256
Viewer
•
Updated
15 days ago
•
639k
•
43
babylm-seqlen/train_100M_1024
Viewer
•
Updated
15 days ago
•
160k
•
46
babylm-seqlen/train_100M_16384
Viewer
•
Updated
15 days ago
•
9.86k
•
42
babylm-seqlen/train_100M_4096
Viewer
•
Updated
15 days ago
•
39.8k
•
37
babylm-seqlen/train_100M_512
Viewer
•
Updated
15 days ago
•
319k
•
34
babylm-seqlen/train_100M_8192
Viewer
•
Updated
15 days ago
•
19.8k
•
30
babylm-seqlen/train_100M_2048
Viewer
•
Updated
15 days ago
•
79.8k
•
32
babylm-seqlen/train_100M_128
Viewer
•
Updated
15 days ago
•
1.28M
•
50
babylm-seqlen/train_100M_64
Viewer
•
Updated
15 days ago
•
2.56M
•
44
Upvote
-
Share collection
View history
Collection guide
Browse collections