--- license: apache-2.0 datasets: - HaoyeZhang/RLAIF-V-Dataset language: - en --- # Model Card for RLAIF-V [GitHub](https://github.com/RLHF-V/RLAIF-V) RLAIF-V is a novel framework that aligns MLLMs in a fully open-source paradigm for super GPT-4V trustworthiness. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and online feedback learning algorithm. ## Model Details ### Model Description - **Trained from model:** [llava-v1.5-7B](https://huggingface.co/liuhaotian/llava-v1.5-7b) - **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)