Audio-Text-to-Text
Transformers
Safetensors
qwen2_audio
text2text-generation
Inference Endpoints
frankenliu commited on
Commit
d0ac662
·
verified ·
1 Parent(s): 8805704

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -13,7 +13,7 @@ pipeline_tag: audio-text-to-text
13
 
14
  R1-AQA is a audio question answering (AQA) model based on `Qwen2-Audio-7B-Instruct`, optimized through reinforcement learning using the group relative policy optimization (GRPO) algorithm.
15
  This implementation has achieved state-of-the-art performance on MMAU *Test-mini* benchmark with only 38k post-training samples.
16
- For more details, please refer to our [Github](https://github.com/xiaomi/r1-aqa) and [Technical Report](https://arxiv.org/abs/2503.11197).
17
 
18
  ### Table: Accuracies (%) on MMAU Test-mini benchmark
19
  | Model | Method | Sound | Music | Speech | Average |
 
13
 
14
  R1-AQA is a audio question answering (AQA) model based on `Qwen2-Audio-7B-Instruct`, optimized through reinforcement learning using the group relative policy optimization (GRPO) algorithm.
15
  This implementation has achieved state-of-the-art performance on MMAU *Test-mini* benchmark with only 38k post-training samples.
16
+ For more details, please refer to our [Github](https://github.com/xiaomi-research/r1-aqa) and [Technical Report](https://arxiv.org/abs/2503.11197).
17
 
18
  ### Table: Accuracies (%) on MMAU Test-mini benchmark
19
  | Model | Method | Sound | Music | Speech | Average |