Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
khopilot
/
km-tokenizer-khmer
like
1
Feature Extraction
khmer-corpus-648mb
Khmer
sentencepiece
tokenizer
khmer
subword
text-generation
nlp
cambodia
southeast-asia
Eval Results
Carbon Emissions
License:
apache-2.0
Model card
Files
Files and versions
Community
main
km-tokenizer-khmer
/
config.json
khopilot
Upload folder using huggingface_hub
f72f63a
verified
13 days ago
raw
Copy download link
history
blame
contribute
delete
93 Bytes
{
"tokenizer_class"
:
"T5Tokenizer"
,
"vocab_size"
:
8000
,
"model_type"
:
"sentencepiece"
}