Tristan/dclm-perplexity-correlations-1b-3-openbookqa-gs7 Text Generation • Updated about 1 month ago • 3
Tristan/RedPajama-Data-V2-sample-100B-filtered-shuffled-tokenized-with-token-counts Viewer • Updated May 31, 2024 • 4.16M • 165
Tristan/RedPajama-Data-V2-sample-100B-filtered-for-regression-domains-with-domains Viewer • Updated May 24, 2024 • 4.16M • 346