Zery/Qwen2-VL-7B_visual_rft_lisa_IoU_reward Image-Text-to-Text β’ 8B β’ Updated Apr 2 β’ 1.86k β’ 5
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement Paper β’ 2503.06520 β’ Published Mar 9 β’ 11
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr β’ Feb 7 β’ 209
Runtime error 72 72 VLM R1 Referral Expression π¬ Mark regions in images based on text descriptions
view article Article Fine tuning CLIP with Remote Sensing (Satellite) images and captions By arampacha and 5 others β’ Oct 13, 2021 β’ 7