Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Vinko Sabolcec's picture
17 1 2

Vinko Sabolcec

vsabolcec
NXz64Fdf8Y's profile picture 21world's profile picture thomwolf's profile picture
·

AI & ML interests

None yet

Recent Activity

authored a paper about 17 hours ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
updated a dataset 4 months ago
epfml/FineWeb2-embedded
updated a dataset 4 months ago
epfml/FineWeb2-HQ
View all activity

Organizations

EPFL Machine Learning and Optimization Laboratory's profile picture FineData's profile picture mlo-data-cleaning's profile picture HuggingFaceFW-Dev's profile picture mlo-data-collab's profile picture mlo-mhq's profile picture

authored a paper about 17 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 2 days ago • 23
updated 5 datasets 4 months ago

epfml/FineWeb2-embedded

Viewer • Updated Feb 19 • 3.98B • 2.93k • 3

epfml/FineWeb2-HQ

Viewer • Updated Feb 19 • 380M • 1.92k • 11

epfml/FineWeb2-HQ

Viewer • Updated Feb 19 • 380M • 1.92k • 11

epfml/FineWeb2-embedded

Viewer • Updated Feb 19 • 3.98B • 2.93k • 3

epfml/FineWeb2-embedded

Viewer • Updated Feb 19 • 3.98B • 2.93k • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs