zhehuderek
/

qwen2_5_vl_3b_GEOQA_8K_hf

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions Community

qwen2_5_vl_3b_GEOQA_8K_hf / README.md

zhehuderek's picture

Update README.md

73929f8 verified about 1 month ago

|

283 Bytes

metadata

library_name: transformers
tags: []

Model Card for Model ID

Base model: Qwen/Qwen2.5-VL-3B-Instruct
Training: GRPO with leonardPKU/GEOQA_8K_R1V
Training log on wandb: https://wandb.ai/ddderek-hk-polyu/easy_r1/runs/d1xtspm0
Total step of 70, not converged yet