VaPR-LLaVA-v1.5-13B

This model is an artifact of the work VaPR โ€“ Vision-language Preference alignment for Reasoning (accepted at COLM 2025). ๐Ÿ“‚ Project Website

Base Model

Training Data

This model has been fine-tuned on the VaPR-30k subset dataset available at: VaPR-30k

Citation

If you use this model, please cite the following paper:

inproceedings{
wadhawan2025vapr,
title={Va{PR} - Vision-language Preference alignment for Reasoning},
author={Rohan Wadhawan and Fabrice Y Harel-Canada and Zi-Yi Dou and Suhaila Shakiah and Robinson Piramuthu and Nanyun Peng},
booktitle={Second Conference on Language Modeling},
year={2025},
url={https://openreview.net/forum?id=uBAubFwymy}
}
Downloads last month
4
Safetensors
Model size
13B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support