zhehuderek
/

qwen2_5_vl_3b_GEOQA_8K_hf

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions Community

zhehuderek commited on Apr 9

Commit

73929f8

·

verified ·

1 Parent(s): 5b68752

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ tags: []
 - Base model: Qwen/Qwen2.5-VL-3B-Instruct
 - Training: GRPO with leonardPKU/GEOQA_8K_R1V
 - Training log on wandb: https://wandb.ai/ddderek-hk-polyu/easy_r1/runs/d1xtspm0
-- Not converged yet

 - Base model: Qwen/Qwen2.5-VL-3B-Instruct
 - Training: GRPO with leonardPKU/GEOQA_8K_R1V
 - Training log on wandb: https://wandb.ai/ddderek-hk-polyu/easy_r1/runs/d1xtspm0
+- Total step of 70, not converged yet