Data shuffled only at the document-level

BabyLM Sequence Length
community
AI & ML interests
BabyLM 2025 paper submission
Recent Activity
View all activity
models
35

babylm-seqlen/opt-4096-warmup-test
Text Generation
•
Updated
•
4

babylm-seqlen/mamba-4096
Updated
•
237

babylm-seqlen/opt-8192
Updated
•
324

babylm-seqlen/opt-4096
Updated
•
208

babylm-seqlen/mamba-8192
Updated
•
257

babylm-seqlen/mamba-8192-warmup
Updated
•
147

babylm-seqlen/mamba-4096-warmup
Updated
•
122

babylm-seqlen/mamba-2048-warmup
Updated
•
20

babylm-seqlen/mamba-512-warmup
Updated
•
18

babylm-seqlen/mamba-1024-warmup
Updated
•
32
datasets
18
babylm-seqlen/train_100M_64
Viewer
•
Updated
•
2.56M
•
13
babylm-seqlen/train_100M_512_single_shuffle
Viewer
•
Updated
•
319k
•
17
babylm-seqlen/train_100M_8192_single_shuffle
Viewer
•
Updated
•
19.8k
•
18
babylm-seqlen/train_100M_2048_single_shuffle
Viewer
•
Updated
•
79.8k
•
17
babylm-seqlen/train_100M_16384_single_shuffle
Viewer
•
Updated
•
9.86k
•
11
babylm-seqlen/train_100M_4096_single_shuffle
Viewer
•
Updated
•
39.8k
•
23
babylm-seqlen/train_100M_256_single_shuffle
Viewer
•
Updated
•
639k
•
20
babylm-seqlen/train_100M_1024_single_shuffle
Viewer
•
Updated
•
160k
•
20
babylm-seqlen/train_100M_128_single_shuffle
Viewer
•
Updated
•
1.28M
•
21
babylm-seqlen/train_100M_64_single_shuffle
Viewer
•
Updated
•
2.56M
•
26