A collection of artifacts related to the Common Pile v0.1—an 8TB dataset of public domain and openly licensed text

Common Pile
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
4
datasets
62
common-pile/pre_1929_books
Viewer
•
Updated
•
134k
•
86
common-pile/stackv2_edu_filtered
Viewer
•
Updated
•
57M
•
184
common-pile/youtube_filtered
Viewer
•
Updated
•
986k
•
59
common-pile/wikiteam_filtered
Viewer
•
Updated
•
10.2M
•
70
common-pile/wikimedia_filtered
Viewer
•
Updated
•
12.9M
•
74
common-pile/uspto_filtered
Viewer
•
Updated
•
14.4M
•
424
common-pile/usgpo_filtered
Viewer
•
Updated
•
2.34M
•
60
common-pile/uk_hansard_filtered
Viewer
•
Updated
•
47.9k
•
63
common-pile/ubuntu_irc_filtered
Viewer
•
Updated
•
216k
•
47
common-pile/stackexchange_filtered
Viewer
•
Updated
•
27.5M
•
191
•
1