meta-llama
/

Llama-3.1-8B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ValueError: `rope_scaling` must be a dictionary with with two fields

#131

by layor - opened Sep 16, 2024

layor

Sep 16, 2024

When deploying my fine tuned model on a dedicated inference endpoint on hugging face this gets triggered:

ValueError: rope_scaling must be a dictionary with with two fields, type and factor, got {'factor': 8.0, 'low_freq_factor': 1.0, 'high_freq_factor': 4.0, 'original_max_position_embeddings': 8192, 'rope_type': 'llama3'}
Application startup failed. Exiting.

I can't access the config.json file since its on the base model so no way to modify these value or to upgrade transformers (since its deploying on huggingface servers).

Any idea what to do?

Sep 17, 2024

Same issue!

Sep 20, 2024

Same here

Sep 20, 2024

Same here. Tried llama 3.1 and it does not work.

Sep 21, 2024

same here

Sep 21, 2024

pip install transformers -U

It works for me.

Sep 21, 2024

@yoo how can you run that on a dedicated inference endpoint when the endpoint can not be deployed?

Sep 26, 2024

Does anyone found a solution? I am trying to deploy using text generation inference.

18 days ago

Does anyone found a solution about this issue? Help!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment