Haoli Yin
Nano1337
AI & ML interests
Multimodal Learning, Data Curation
Recent Activity
upvoted
a
paper
5 days ago
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale
Pretraining
new activity
about 2 months ago
MAmmoTH-VL/MAmmoTH-VL-Instruct-12M:Is it right that the uploaded json files are just the original seed data?
liked
a model
2 months ago
DatologyAI/retr-opt-vit-b-32