zhehuderek
/

qwen2_5_vl_3b_GEOQA_8K_hf

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions Community

qwen2_5_vl_3b_GEOQA_8K_hf / README.md

zhehuderek's picture

Update README.md

73929f8 verified about 1 month ago

|

283 Bytes

	---
	library_name: transformers
	tags: []
	---

	# Model Card for Model ID

	- Base model: Qwen/Qwen2.5-VL-3B-Instruct
	- Training: GRPO with leonardPKU/GEOQA_8K_R1V
	- Training log on wandb: https://wandb.ai/ddderek-hk-polyu/easy_r1/runs/d1xtspm0
	- Total step of 70, not converged yet