
VTOOL/VTOOL-R1-7B-F
8B
•
Updated
•
13
None defined yet.
Model weights for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"
We are working on training better versions of our Table models, they will be available very soon.
If you find our project helpful, please cite:
@misc{wu2025vtoolr1vlmslearnthink, title={VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use}, author={Mingyuan Wu and Jingcheng Yang and Jize Jiang and Meitang Li and Kaizhuo Yan and Hanchao Yu and Minjia Zhang and Chengxiang Zhai and Klara Nahrstedt}, year={2025}, eprint={2505.19255}, archivePrefix={arXiv}, primaryClass={cs.LG}, url={https://arxiv.org/abs/2505.19255}, }