Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models Paper • 2410.16163 • Published Oct 21, 2024 • 1
GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking Paper • 2506.01078 • Published 10 days ago • 1
GThinker Collection GThinker is a strong MLLM skilled in multimodal reasoning across scenarios. • 4 items • Updated 8 days ago
GThinker Collection GThinker is a strong MLLM skilled in multimodal reasoning across scenarios. • 4 items • Updated 8 days ago
Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning Paper • 2503.18013 • Published Mar 23 • 19