Issue with tokenizer_config.json

#140

by LucaF - opened 2 days ago

Discussion

LucaF

2 days ago

•

edited 2 days ago

Hello!
In the file tokenizer_config.json, there is a very large value for model_max_length. I suppose it might be a typo?

  "bos_token": "<|startoftext|>",
  "clean_up_tokenization_spaces": false,
  "eos_token": "<|return|>",
  "extra_special_tokens": {},
  "model_input_names": [
    "input_ids",
    "attention_mask"
  ],
  "model_max_length": 1000000000000000019884624838656,
  "pad_token": "<|endoftext|>",
  "tokenizer_class": "PreTrainedTokenizerFast"
}

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment