metadata
base_model:
- Qwen/Qwen2.5-VL-3B-Instruct
datasets:
- WaltonFuture/Multimodal-Cold-Start
- WaltonFuture/Multimodal-RL-Data
license: apache-2.0
pipeline_tag: image-text-to-text
library_name: transformers
- ๐ GitHub Repo: waltonfuture/RL-with-Cold-Start
- ๐ Paper (arXiv): Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start (arXiv:2505.22334)