Rex-Thinker: Grounded Object Referring via Chain-of-Thought Reasoning Paper • 2506.04034 • Published Jun 4 • 2
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated Apr 30 • 75
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper • 2405.10300 • Published May 16, 2024 • 31