metadata

license: apache-2.0
datasets:
  - HaoyeZhang/RLAIF-V-Dataset
language:
  - en

Model Card for RLAIF-V

GitHub ]

RLAIF-V is a novel framework that aligns MLLMs in a fully open-source paradigm for super GPT-4V trustworthiness. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and online feedback learning algorithm.

Model Details

Model Description

Trained from model: llava-v1.5-7B
Trained on data: RLAIF-V-Dataset