---
license: apache-2.0
datasets:
- HaoyeZhang/RLAIF-V-Dataset
language:
- en
---

# Model Card for RLAIF-V

[GitHub](https://github.com/RLHF-V/RLAIF-V)

RLAIF-V is a novel framework that aligns MLLMs in a fully open-source paradigm for super GPT-4V trustworthiness. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and online feedback learning algorithm. 


## Model Details

### Model Description
- **Trained from model:** [llava-v1.5-7B](https://huggingface.co/liuhaotian/llava-v1.5-7b)
- **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)