Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceTB
's Collections
π LLM pretraining datasets
𧩠SmolLM2 Intermdiate Checkpoints
SmolVLM2 πΊ Smallest video LM ever π€π»
The Ultimate Collection of Code Classifiers
SmolVLM 256M & 500M
SmolLM2
π FineMath
SmolVLM
π» Local SmolLMs
πͺ SmolLM
Instruct datasets
π Cosmopedia
Find textbooks in FineWeb with a classifier
FineWeb clustering & synthetic generations
Other: Stanford, OpenStax, khanAcademy, wikihow...
FW generation prompts
Wikipedia Science topics
Wikipedia textbooks
SFT Experiments
Decay mixture experiments
models
models
updated
Feb 20
Upvote
2
openai-community/gpt2
Text Generation
β’
Updated
Feb 19, 2024
β’
16.8M
β’
β’
2.65k
openai-community/gpt2-medium
Text Generation
β’
Updated
Feb 19, 2024
β’
449k
β’
β’
171
openai-community/gpt2-xl
Text Generation
β’
Updated
Feb 19, 2024
β’
479k
β’
β’
334
karpathy/gpt2_1558M_final4_hf
Text Generation
β’
Updated
Jul 12, 2024
β’
204
β’
5
EleutherAI/pythia-160m
Text Generation
β’
Updated
Jul 9, 2023
β’
106k
β’
β’
30
Qwen/Qwen2-0.5B
Text Generation
β’
Updated
Oct 22, 2024
β’
99.2k
β’
β’
138
Upvote
2
Share collection
View history
Collection guide
Browse collections