OpenGVLab/InternVL2-4B · Unable to load models through transformers pipeline wrapper

May 20

Hi, I'm trying to test this model in google colab, and I am unable to load the model through the transformers' pipeline wrapper.
here's the code:

from transformers import pipeline

pipe = pipeline("image-text-to-text", model="OpenGVLab/InternVL2-2B", trust_remote_code=True)

Here's the error:

ValueError: Could not load model OpenGVLab/InternVL2-2B with any of the following classes: (<class 'transformers.models.auto.modeling_auto.AutoModelForImageTextToText'>,). See the original errors:

while loading with AutoModelForImageTextToText, an error is thrown:
Traceback (most recent call last):
  File "/usr/local/lib/python3.11/dist-packages/transformers/pipelines/base.py", line 291, in infer_framework_load_model
    model = model_class.from_pretrained(model, **kwargs)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/dist-packages/transformers/models/auto/auto_factory.py", line 574, in from_pretrained
    raise ValueError(
ValueError: Unrecognized configuration class <class 'transformers_modules.OpenGVLab.InternVL2-2B.e4f6747bd20f139e637642c6a058c6bd00b36919.configuration_internvl_chat.InternVLChatConfig'> for this kind of AutoModel: AutoModelForImageTextToText.
Model type should be one of AriaConfig, AyaVisionConfig, BlipConfig, Blip2Config, ChameleonConfig, Emu3Config, FuyuConfig, Gemma3Config, GitConfig, GotOcr2Config, IdeficsConfig, Idefics2Config, Idefics3Config, InstructBlipConfig, Kosmos2Config, Llama4Config, LlavaConfig, LlavaNextConfig, LlavaOnevisionConfig, Mistral3Config, MllamaConfig, PaliGemmaConfig, Pix2StructConfig, PixtralVisionConfig, Qwen2_5_VLConfig, Qwen2VLConfig, ShieldGemma2Config, SmolVLMConfig, UdopConfig, VipLlavaConfig, VisionEncoderDecoderConfig.

I have also tried upgrading the transformers library to the latest version, but I am still facing this issue. Please let me know the steps required to fix this.

Thanks!

ayushman72

Jun 26

still persists

laitifranz

Jul 10

I found that compatible versions of InternVL models for HF pipelines are under names ending with -hf. For example, https://huggingface.co/OpenGVLab/InternVL3-8B-hf is the HF Transformer implementation of the original repo InternVL3-8B. This solves me the configuration class error