based on phi-3 tokenizer, expanded 17291 tokens
Following The Optimal Vocabulary Size Predictor, I recommend using this tokenizer with 3-4B model such as phi-3-mini
based on phi-3 tokenizer, expanded 17291 tokens
Following The Optimal Vocabulary Size Predictor, I recommend using this tokenizer with 3-4B model such as phi-3-mini