arxiv:2501.08365
Sebastian Majstorovic PRO
storytracer
AI & ML interests
Open Data Specialist @EleutherAI
Recent Activity
authored
a paper
2 days ago
Towards Best Practices for Open Datasets for LLM Training
liked
a Space
about 1 month ago
opencompass/open_vlm_leaderboard
liked
a dataset
about 2 months ago
uonlp/CulturaX
Organizations
Papers
1
models
None public yet
datasets
9
storytracer/usgpo
Viewer
•
Updated
•
3.75M
•
3
storytracer/public_library_1929_dolma
Viewer
•
Updated
•
9.08k
•
70
storytracer/hathi_full_20240501
Viewer
•
Updated
•
18.4M
•
31
storytracer/hathi_pd_books_us_ia_2024-05-01
Viewer
•
Updated
•
225k
•
29
storytracer/openlibrary_dump_2024-04-30
Preview
•
Updated
•
68
storytracer/loc_books_dolma
Viewer
•
Updated
•
14.1k
•
39
storytracer/German-PD-Newspapers
Viewer
•
Updated
•
5.38M
•
191
•
4
storytracer/LoC-PD-Books
Viewer
•
Updated
•
16.5k
•
440
•
28
storytracer/US-PD-Books
Viewer
•
Updated
•
654k
•
233
•
181