Peng Liu

P3ngLiu

AI & ML interests

CV, Multimodal, OVD

Recent Activity

Organizations

Om AI Lab's profile picture

P3ngLiu's activity

reacted to tianchez's post with ๐Ÿš€๐Ÿ‘ 21 days ago
view post
Post
4082
Introducing VLM-R1!

GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks?

The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task).

https://github.com/om-ai-lab/VLM-R1
ยท
New activity in omlab/omdet-turbo-swin-tiny-hf 3 months ago
New activity in omlab/OmDet-Turbo_tiny_SWIN_T 9 months ago

Update README.md

1
#1 opened 9 months ago by
nielsr