Tony Zhao

tianchez

AI & ML interests

Multimodal Agent, Generative AI

Recent Activity

updated a model about 2 months ago
omlab/VLM-R1-Qwen2.5VL-3B-Math-0305
updated a model about 2 months ago
omlab/Qwen2.5VL-3B-VLM-R1-REC-500steps
View all activity

Organizations

Om AI Lab's profile picture

tianchez's activity

upvoted 2 articles 3 months ago
view article
Article

Improving Object Detection through Reinforcement Learning with VLM-R1

By omlab and 5 others
2
view article
Article

Trials, Errors, and Breakthroughs: Our Rocky Road to OVD SOTA with Reinforcement Learning

By omlab and 5 others
1
replied to AdinaY's post 3 months ago
replied to their post 4 months ago
reacted to their post with 👍 4 months ago
view post
Post
4438
Introducing VLM-R1!

GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks?

The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task).

https://github.com/om-ai-lab/VLM-R1
·
New activity in omlab/VLM-R1-Referral-Expression 4 months ago
reacted to their post with ❤️ 4 months ago
view post
Post
4438
Introducing VLM-R1!

GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks?

The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task).

https://github.com/om-ai-lab/VLM-R1
·