Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

llava-hf
/
llava-onevision-qwen2-0.5b-ov-hf

Image-Text-to-Text
Transformers
ONNX
Safetensors
Transformers.js
English
Chinese
llava_onevision
vision
conversational
Model card Files Files and versions Community
9
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

what's the difference between "ov" and "si" ?

#9 opened about 1 month ago by
cos0sin0

Can't reproduce given example (no meaningful output)

1
#8 opened 3 months ago by
pzarzycki

Error for fine tuning model when using FSDP: auto wrap: Could not find the transformer layer class LlavaOnevisionVisionAttention in the model.

1
#6 opened 6 months ago by
liuzijing2014

Error when attempting to run either model... ValueError: embed_dim must be divisible by num_heads (got `embed_dim`: 1152 and `num_heads`: 14).

3
#4 opened 8 months ago by
jdc4429

Download transformers for LlavaOnevisionForConditionalGeneration

2
#1 opened 9 months ago by
mjbooo
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs