Upload tokenizer
hello can this be merged?
hello can this be merged?
Yes, it will. We're checking with the author who provided a transfomers-compatible tokenizer in discussions several days ago.
Yes you can merge! As mentioned in another post, this tokenizer matches the original on the entire xnli dataset (all languages)! This PR also adds the slow-tokenizer in case a user wants to fallback on it.
Thank you @ArthurZ . I've merged the PR and tested on it. It works pretty well!
hello can this be merged?
Hey @ehartford , the PR has been merged and you can now directly use the following method to load the tokenizer
tokenizer = AutoTokenizer.from_pretrained("hpcai-tech/grok-1", trust_remote_code=True)
If you have downloaded the model as a repository, you might want to use git pull
to get the tokenizer updated.
We have also updated usage case in both model card and our example in ColossalAI GitHub Repository.