Transformers
Vietnamese
English

Tokenizer only

This tokenizer was trained on thng292/fineweb-subset-1M on vi and en subset.

Vocab size: 65536

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train thng292/fineweb-vi-en-tokenizer