Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceTB
/
cosmo2-tokenizer
like
3
Follow
Hugging Face Smol Models Research
1.8k
Transformers
HuggingFaceTB/cosmo2_training_data_subset_1M
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
cosmo2-tokenizer
cosmo2-tokenizer
Tokenizer for the training of cosmo2. This tokenizer was trained on 1M samples from:
FineWeb-Edu 70%
Cosmopedia v2 15%
StarCoderData 8%
OpenWebMath 5%
StackOverFlow 2%
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support
Spaces using
HuggingFaceTB/cosmo2-tokenizer
30
π
ariG23498/nanovlm
π¦
MilindChawre/SmolLM2-Text-Generator
π
hashvibe007/smollm2
π
crpatel/SmolLMTextGenerator
π
crpatel/deepseek-v3-text-generation
π
Tousifahamed/SmolTextGen
π
Tousifahamed/smol-lm2-demo
π»
satyanayak/SmalLMv2-Text-Generator
π
nishantb06/SmolLM-Text-Generator
π
anjikum/generate_text_smollm2-135M_implementation
π
EzhirkoArulmozhi/TextGeneratorSmolLM2
π
nishantb06/SmolLMTextGenerator-5k
+ 25 Spaces
+ 18 Spaces