-
Essential-Web v1.0: 24T tokens of organized web data
Paper • 2506.14111 • Published • 44 -
EssentialAI/essential-web-v1.0
Preview • Updated • 42.8k • 200 -
EssentialAI/eai-distill-0.5b
0.6B • Updated • 1.4k • 23 -
EssentialAI/eai-taxonomy-math-w-fm
Viewer • Updated • 21.6M • 1.4k • 5
AI & ML interests
None defined yet.
Organization Card
Essential AI
This is the home for models and data released by the Essential AI research team.
-
Essential-Web v1.0: 24T tokens of organized web data
Paper • 2506.14111 • Published • 44 -
EssentialAI/essential-web-v1.0
Preview • Updated • 42.8k • 200 -
EssentialAI/eai-distill-0.5b
0.6B • Updated • 1.4k • 23 -
EssentialAI/eai-taxonomy-math-w-fm
Viewer • Updated • 21.6M • 1.4k • 5
Datasets & Artifacts related to the paper "Rethinking Reflection in Pre-Training"
datasets
17
EssentialAI/eai-taxonomy-stem-w-dclm-100b-sample
Viewer
•
Updated
•
35.5M
•
420
•
4
EssentialAI/eai-taxonomy-stem-w-dclm
Preview
•
Updated
•
2.55k
•
5
EssentialAI/eai-taxonomy-med-w-dclm-100b-sample
Viewer
•
Updated
•
36.6M
•
831
•
2
EssentialAI/eai-taxonomy-med-w-dclm
Viewer
•
Updated
•
81.2M
•
818
•
8
EssentialAI/eai-taxonomy-code-w-dclm-100b-sample
Viewer
•
Updated
•
46.2M
•
319
•
2
EssentialAI/eai-taxonomy-code-w-dclm
Viewer
•
Updated
•
274M
•
10.5k
•
7
EssentialAI/eai-taxonomy-math-w-fm
Viewer
•
Updated
•
21.6M
•
1.4k
•
5
EssentialAI/essential-web-v1.0
Preview
•
Updated
•
42.8k
•
200
EssentialAI/reflection_model_outputs_run2
Updated
•
422
EssentialAI/reflection_model_outputs_run1
Preview
•
Updated
•
860