Smol, multilingual, long-context reasoner
AI & ML interests
Exploring smol models (for text, vision and video) and high quality web and synthetic datasets
Recent Activity
View all activity
Organization Card
Hugging Face Smol Models Research
This is the home for smol models (SmolLM & SmolVLM) and high quality pre-training datasets. We released:
- FineWeb-Edu: a filtered version of FineWeb dataset for educational content, paper available here.
- Cosmopedia: the largest open synthetic dataset, with 25B tokens and 30M samples. It contains synthetic textbooks, blog posts, and stories, posts generated by Mixtral. Blog post available here.
- Smollm-Corpus: the pre-training corpus of SmolLM: Cosmopedia v0.2, FineWeb-Edu dedup and Python-Edu. Blog post available here.
- FineMath: the best public math pretraining dataset with 50B tokens of mathematical and problem solving data.
- Stack-Edu: the best open code pretraining dataset with educational code in 15 programming languages.
- SmolLM2 models: a series of strong small models in three sizes: 135M, 360M and 1.7B
- SmolVLM2: a family of small Video and Vision models in three sizes: 2.2B, 500M and 256M. Blog post available here.
News 🗞️
- SmolLM3: SOTA 3B model with dual reasoning, supports 6 languages and long context with strong function calling: HuggingFaceTB/SmolLM3-3B
- SmolLM3 Engineering Blueprint available here.
spaces
15
Running
16
SmolLM3 WebGPU
🚀
A dual reasoning model that runs locally in your browser.
Running
on
Zero
79
SmolVLM
📊
Answer questions using images or videos
Running
28
WikiRacing Language Models
🏃
Find answers by racing against LLM in a quiz game
Running
36
SmolLM2 1.7B Instruct WebGPU
🚀
A blazingly fast & powerful AI chatbot that runs in-browser!
Running
52
SmolVLM 256M Instruct WebGPU
🐨
Generate descriptions for images using WebGPU technology
Running
4
Smolvlm Web Benchmarking
🌖
models
77

HuggingFaceTB/SmolLM3-3B-ONNX
Text Generation
•
Updated
•
232
•
12

HuggingFaceTB/SmolLM3-3B-Base
Text Generation
•
3B
•
Updated
•
3.14k
•
98

HuggingFaceTB/SmolLM3-3B
Text Generation
•
3B
•
Updated
•
27.1k
•
•
444

HuggingFaceTB/SmolLM2-360M-Instruct
Text Generation
•
0.4B
•
Updated
•
110k
•
133

HuggingFaceTB/SmolLM2-135M-Instruct
Text Generation
•
0.1B
•
Updated
•
323k
•
219

HuggingFaceTB/SmolLM2-1.7B-Instruct
Text Generation
•
2B
•
Updated
•
60.1k
•
656

HuggingFaceTB/SmolVLM2-2.2B-Base
Image-Text-to-Text
•
2B
•
Updated
•
1.14k
•
6

HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text
•
0.3B
•
Updated
•
86.4k
•
254

HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
•
2B
•
Updated
•
148k
•
510

HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
0.5B
•
Updated
•
28.1k
•
164
datasets
51
HuggingFaceTB/smoltalk2
Viewer
•
Updated
•
8.61M
•
1.23k
•
49
HuggingFaceTB/smollm3-configs
Updated
•
55
•
2
HuggingFaceTB/smollm3-blueprint
Viewer
•
Updated
•
1
•
269
•
4
HuggingFaceTB/images
Viewer
•
Updated
•
50
•
74.5k
•
1
HuggingFaceTB/smoltalk-multilingual8-Qwen3-32B-main-gen
Viewer
•
Updated
•
1.35M
•
137
HuggingFaceTB/MCQ_Wiki_-decontaminated_shard_2
Viewer
•
Updated
•
1.4M
•
116
HuggingFaceTB/MCQ_Wiki_decontamination_report_shard_2
Viewer
•
Updated
•
155
•
100
HuggingFaceTB/MCQ_Wiki_-decontaminated_shard_0
Viewer
•
Updated
•
1.4M
•
126
HuggingFaceTB/MCQ_Wiki_decontamination_report_shard_0
Viewer
•
Updated
•
174
•
99
HuggingFaceTB/Falcon-details
Viewer
•
Updated
•
1.32k
•
16