Text Generation
Transformers
Safetensors
English
llava_llama
File size: 661 Bytes
d2127c2
 
 
 
 
 
 
 
 
 
9a9c8e4
d2127c2
 
 
 
 
 
 
8c44cd5
d2127c2
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
---
license: apache-2.0
datasets:
- HaoyeZhang/RLAIF-V-Dataset
language:
- en
---

# Model Card for RLAIF-V

[GitHub](https://github.com/RLHF-V/RLAIF-V) ]

RLAIF-V is a novel framework that aligns MLLMs in a fully open-source paradigm for super GPT-4V trustworthiness. RLAIF-V maximally exploits the open-source feedback from two key perspectives, including high-quality feedback data and online feedback learning algorithm. 


## Model Details

### Model Description
- **Trained from model:** [llava-v1.5-7B](https://huggingface.co/liuhaotian/llava-v1.5-7b)
- **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)