eos_token should be <|eot_id|>

by AUTOMATIC - opened Apr 19, 2024

Apr 19, 2024

tokenizer_config.json should list "eos_token" as "<|eot_id|>", othwerwise the chat is spammed with .assistant things and never ends.

Apr 19, 2024

I had to change it in both tokenizer_config.json as well as in special_tokens_map.json.

Owner Apr 19, 2024

Is that the accepted fix? The files were just copied from the original Meta L3 files.

Apr 19, 2024

I don't believe so as changing this in exl2 quant affected the way model behaved and followed instructions

Apr 20, 2024

Apr 20, 2024

I've opened the pull request for this fix in #2, hope that it will be merged. Amazing model, shame that it has this tokenizer problem on the start.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment