ValueError: InternVLForConditionalGeneration has no vLLM implementation and the Transformers implementation is not compatible with vLLM.

by yang396409487 - opened Apr 30

Apr 30

•

when I use vllm to infer, I got this error.
I find that the architectures of hf version is different from base version like [OpenGVLab/InternVL2_5-8B-MPO].
the base version architectures in config.json of is InternVLChatModel.

how to solve this?

model:OpenGVLab/InternVL2_5-8B-MPO-hf
vllm: 0.7.2
torch: 2.5.1

I also try below version, it does not work either
vllm: 0.8.5
torch: 2.6.0

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment