Submitted by Weiyun1025 116 InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models · 47 authors 2
Submitted by LIKirin 68 PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters · 5 authors 3
Submitted by starriver030515 27 FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding · 7 authors 2
Submitted by wenhu 21 VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning · 6 authors 1
Submitted by DogNeverSleep 20 Mavors: Multi-granularity Video Representation for Multimodal Large Language Model · 15 authors 1
Submitted by xhluca 13 AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories · 10 authors 1
Submitted by AIRobotZ 11 S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models · 5 authors 2
Submitted by ztwang 11 DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training · 4 authors 1
Submitted by cuijiaxing 10 Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability · 3 authors 1
Submitted by parshinsh 6 LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models · 6 authors 1
Submitted by brucelyu 5 SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users · 21 authors 1
Submitted by leoozy 2 Breaking the Data Barrier -- Building GUI Agents Through Task Generalization · 7 authors 1
Submitted by ChrisJuan 2 EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety · 10 authors 2
Submitted by mqliu 1 LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models · 11 authors 1
Submitted by codezakh 1 Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems · 5 authors 1
Submitted by LibraTree 1 VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search · 8 authors 3
Submitted by akhaliq - M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models · 6 authors 1