File size: 283 Bytes
bd77adb
 
 
 
 
 
 
5b68752
 
 
73929f8
bd77adb
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---
library_name: transformers
tags: []
---

# Model Card for Model ID

- Base model: Qwen/Qwen2.5-VL-3B-Instruct
- Training: GRPO with leonardPKU/GEOQA_8K_R1V
- Training log on wandb: https://wandb.ai/ddderek-hk-polyu/easy_r1/runs/d1xtspm0
- Total step of 70, not converged yet