Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceTB
/
cosmo2-tokenizer
like
1
Follow
Hugging Face TB Research
981
Transformers
HuggingFaceTB/cosmo2_training_data_subset_1M
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
cosmo2-tokenizer
cosmo2-tokenizer
Tokenizer for the training of cosmo2. This tokenizer was trained on 1M samples from:
FineWeb-Edu 70%
Cosmopedia v2 15%
StarCoderData 8%
OpenWebMath 5%
StackOverFlow 2%
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this modelβs pipeline type.
Spaces using
HuggingFaceTB/cosmo2-tokenizer
19
π
Tousifahamed/SmolTextGen
π
EzhirkoArulmozhi/TextGeneratorSmolLM2
π
nishantb06/SmolLMTextGenerator-5k
π’
kalekarnn/SmolLM2-135-model
π’
Shilpaj/SmoLLMv2
π’
Shriti09/CustomSmol2TextGenerator
π»
Shriti09/Smol2TextGenerator
π
sudhakar272/SmolLM2-135TextGenerator
π»
Rajendro/SmalLMv2-TextGenerator
π»
Rakavi12/smolLm2-135M-replica
π
Tousifahamed/smol-lm2-demo
π¦
MilindChawre/SmolLM2-Text-Generator
+ 14 Spaces
+ 7 Spaces