VaPR-LLaVA-v1.5-13B

This model is an artifact of the work VaPR – Vision-language Preference alignment for Reasoning (accepted at COLM 2025). 📂 Project Website

Base Model

Base Model: LLaVA v1.5 13B Instruct

Training Data

This model has been fine-tuned on the VaPR-30k subset dataset available at: VaPR-30k

Citation

If you use this model, please cite the following paper:

inproceedings{
wadhawan2025vapr,
title={Va{PR} - Vision-language Preference alignment for Reasoning},
author={Rohan Wadhawan and Fabrice Y Harel-Canada and Zi-Yi Dou and Suhaila Shakiah and Robinson Piramuthu and Nanyun Peng},
booktitle={Second Conference on Language Modeling},
year={2025},
url={https://openreview.net/forum?id=uBAubFwymy}
}

Downloads last month: 4

Safetensors

Model size

13B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support