Luca Soldaini
soldni
AI & ML interests
question answering, information retrieval, scientific document processing
Recent Activity
updated
a dataset
28 days ago
allenai/olmOCR-pes2o-0225
liked
a dataset
29 days ago
tokyotech-llm/swallow-code
updated
a dataset
about 1 month ago
nkandpa2/cccc_all_domains
Organizations
soldni's activity
Update README.md
#2 opened 3 months ago
by
reach-vb

Update README.md
#1 opened 3 months ago
by
reach-vb

Add library tag!
#1 opened 3 months ago
by
reach-vb

Upload folder using huggingface_hub
#1 opened 3 months ago
by
soldni

Fix loading and data viewer due to nested dirs
1
#3 opened 6 months ago
by
orionweller

Failed to load dataset
9
#3 opened 9 months ago
by
joelb

latest update?
2
#8 opened about 1 year ago
by
fkov
Seeing Arxiv content in the Algebraic Stack subset
3
#2 opened 9 months ago
by
dangerzone

Add `transformers` as library_name
#2 opened 9 months ago
by
Wauplin

How to run it on a mobile device?
👀
1
3
#1 opened 9 months ago
by
KoiSikhaDo
Add proper library name
#3 opened 9 months ago
by
osanseviero

accidentally released?
1
#1 opened 10 months ago
by
Fizzarolli

What is the total # tokens after sampling proportion? 1.7T or 1.65T
3
#36 opened about 1 year ago
by
ivanzhouyq

v1_7 update
#28 opened about 1 year ago
by
kylel

Does allenai/c4 and the subset C4 in allenai/dolma is the same dataset?
4
#10 opened about 1 year ago
by
speiqin
Can't download two files
1
#19 opened over 1 year ago
by
mrgorjan
Prompting to OLMo
2
#8 opened over 1 year ago
by
herambpatil2004
Update README.md
#10 opened over 1 year ago
by
Muennighoff

Add download instructions
#8 opened over 1 year ago
by
Muennighoff

Fix size
#9 opened over 1 year ago
by
Muennighoff
