freesky
/

InternVL-Chat-V1-5_ft_by_DecoVQAplus

Visual Question Answering

Model card Files Files and versions

freesky commited on Oct 1, 2024

Commit

8d9ae23

·

verified ·

1 Parent(s): ede8620

Update README.md

Files changed (1) hide show

README.md +22 -3

README.md CHANGED Viewed

@@ -1,3 +1,22 @@
----
-license: mit
----

+---
+license: mit
+language:
+- en
+base_model:
+- OpenGVLab/InternVL-Chat-V1-5
+---
+## Citation
+If you use this finetuned model checkpoint in your research, please cite our paper as follows:
+```bibtex
+      @misc{zhang2024visualquestiondecompositionmultimodal,
+      title={Visual Question Decomposition on Multimodal Large Language Models},
+      author={Haowei Zhang and Jianzhe Liu and Zhen Han and Shuo Chen and Bailan He and Volker Tresp and Zhiqiang Xu and Jindong Gu},
+      year={2024},
+      eprint={2409.19339},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2409.19339},
+}
+```