PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns Paper • 2403.13315 • Published Mar 20, 2024
Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths Paper • 2410.10858 • Published Oct 7, 2024
Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models Paper • 2409.14277 • Published Sep 22, 2024