Text Generation
Transformers
Safetensors
English
llava_llama
RLAIF-V-7B / README.md
XiaomanLu's picture
Update README.md
9a9c8e4 verified
|
raw
history blame
661 Bytes
metadata
license: apache-2.0
datasets:
  - HaoyeZhang/RLAIF-V-Dataset
language:
  - en

Model Card for RLAIF-V

GitHub ]

RLAIF-V is a novel framework that aligns MLLMs in a fully open-source paradigm for super GPT-4V trustworthiness. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and online feedback learning algorithm.

Model Details

Model Description