Audio-Text-to-Text
Transformers
Safetensors
qwen2_audio
text2text-generation
Inference Endpoints
franken commited on
Commit
26598db
·
verified ·
1 Parent(s): 4011eae

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -10,7 +10,7 @@ tags: []
10
 
11
  ## Introduction
12
 
13
- R1-AQA is based on `Qwen2-Audio-7B-Instruc`, but applied group relative policy optimization (GRPO) algorithm to the Audio Question Answering(AQA) task.
14
  For more details, please refer to our [Github](https://github.com/xiaomi/r1-aqa) and [Report]().
15
 
16
 
 
10
 
11
  ## Introduction
12
 
13
+ R1-AQA extends `Qwen2-Audio-7B-Instruc` by integrating group relative policy optimization (GRPO). This adaptation enhances the model's capacity for temporal reasoning and contextual alignment in audio question answering (AQA) tasks.
14
  For more details, please refer to our [Github](https://github.com/xiaomi/r1-aqa) and [Report]().
15
 
16