Submitted by foggyforest 88 Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models · 22 authors 1
Submitted by scofield7419 59 On Path to Multimodal Generalist: General-Level and General-Bench · 32 authors 5
Submitted by vvibt 19 Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models · 13 authors 3
Submitted by akhaliq 12 Generating Physically Stable and Buildable LEGO Designs from Text · 6 authors 1
Submitted by WHB139426 10 StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant · 9 authors 1
Submitted by shengz 8 X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains · 12 authors 2
Submitted by arianhosseini 5 Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers · 5 authors 1
Submitted by dogtooth 5 SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning · 2 authors 1
Submitted by RanjanSapkota 4 Vision-Language-Action Models: Concepts, Progress, Applications and Challenges · 4 authors 1
Submitted by PALIN2018 4 BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese · 16 authors 1