llava-hf
/

vip-llava-7b-hf

Image-Text-to-Text

Model card Files Files and versions Community

nielsr HF staff commited on Feb 5

Commit

5103138

•

1 Parent(s): 0d091ba

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -22,11 +22,14 @@ Or check out our Spaces demo! [![Open in Spaces](https://huggingface.co/datasets
 LLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on GPT-generated multimodal instruction-following data.
 It is an auto-regressive language model, based on the transformer architecture.
 **Model date:**
-LLaVA-v1.5-7B was trained in September 2023.
 **Paper or resources for more information:**
-https://llava-vl.github.io/
 ## How to use the model

 LLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on GPT-generated multimodal instruction-following data.
 It is an auto-regressive language model, based on the transformer architecture.
+Vip-LlaVa enhances the training protocol of Llava by marking images and interact with the model using natural cues like a
+“red bounding box” or “pointed arrow” during training.
 **Model date:**
+ViP-LLaVa was released in December 2023.
 **Paper or resources for more information:**
+https://vip-llava.github.io/
 ## How to use the model