VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Paper β’ 2501.00599 β’ Published Dec 31, 2024 β’ 48
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper β’ 2501.13106 β’ Published Jan 22 β’ 90
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning Paper β’ 2507.22607 β’ Published 24 days ago β’ 45
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Paper β’ 2501.00599 β’ Published Dec 31, 2024 β’ 48
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning Paper β’ 2507.22607 β’ Published 24 days ago β’ 45
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper β’ 2507.14683 β’ Published Jul 19 β’ 126
SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia Paper β’ 2502.06298 β’ Published Feb 10 β’ 1
Large Language Models can Contrastively Refine their Generation for Better Sentence Representation Learning Paper β’ 2310.10962 β’ Published Oct 17, 2023
Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization Paper β’ 2502.16825 β’ Published Feb 24 β’ 7
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations Paper β’ 2504.13816 β’ Published Apr 18 β’ 17
SOUL: Towards Sentiment and Opinion Understanding of Language Paper β’ 2310.17924 β’ Published Oct 27, 2023
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback Paper β’ 2505.17873 β’ Published May 23 β’ 31
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper β’ 2507.14683 β’ Published Jul 19 β’ 126
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World? Paper β’ 2506.05287 β’ Published Jun 5 β’ 15
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Paper β’ 2506.07044 β’ Published Jun 8 β’ 112
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning Paper β’ 2506.09513 β’ Published Jun 11 β’ 98
Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective Paper β’ 2506.17930 β’ Published Jun 22 β’ 19