Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
khopilot
/
km-tokenizer-khmer
like
1
Feature Extraction
khmer-corpus-648mb
Khmer
sentencepiece
tokenizer
khmer
subword
text-generation
nlp
cambodia
southeast-asia
Eval Results
Carbon Emissions
License:
apache-2.0
Model card
Files
Files and versions
Community
main
km-tokenizer-khmer
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
khopilot
Upload tokenizer_config.json with huggingface_hub
cab1437
verified
13 days ago
.gitattributes
Safe
1.52 kB
initial commit
13 days ago
README.md
12.9 kB
Upload folder using huggingface_hub
13 days ago
config.json
93 Bytes
Upload folder using huggingface_hub
13 days ago
special_tokens_map.json
1.01 kB
Upload folder using huggingface_hub
13 days ago
spiece.model
164 kB
LFS
Upload folder using huggingface_hub
13 days ago
tokenizer.model
164 kB
LFS
Upload folder using huggingface_hub
13 days ago
tokenizer.vocab
169 kB
Upload folder using huggingface_hub
13 days ago
tokenizer_config.json
196 Bytes
Upload tokenizer_config.json with huggingface_hub
13 days ago