--- base_model: - Qwen/Qwen2.5-VL-3B-Instruct datasets: - WaltonFuture/Multimodal-Cold-Start - WaltonFuture/Multimodal-RL-Data license: apache-2.0 pipeline_tag: image-text-to-text library_name: transformers --- * 🐙 **GitHub Repo:** [waltonfuture/RL-with-Cold-Start](https://github.com/waltonfuture/RL-with-Cold-Start) * 📜 **Paper (arXiv):** [Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start (arXiv:2505.22334)](https://arxiv.org/abs/2505.22334)