WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale Paper • 2502.16684 • Published Feb 23
Through the Valley: Path to Effective Long CoT Training for Small Language Models Paper • 2506.07712 • Published 18 days ago • 18
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning Paper • 2506.09513 • Published 17 days ago • 92
MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction Paper • 2305.18969 • Published May 30, 2023
Parameter-Efficient Conversational Recommender System as a Language Processing Task Paper • 2401.14194 • Published Jan 25, 2024
MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models Paper • 2406.13975 • Published Jun 20, 2024
CoIR: A Comprehensive Benchmark for Code Information Retrieval Models Paper • 2407.02883 • Published Jul 3, 2024
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper • 2502.20238 • Published Feb 27 • 24
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations Paper • 2504.13816 • Published Apr 18 • 17
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Paper • 2506.07044 • Published 20 days ago • 105
DreamGen: Unlocking Generalization in Robot Learning through Neural Trajectories Paper • 2505.12705 • Published May 19
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots Paper • 2503.14734 • Published Mar 18 • 3
Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework Paper • 2503.10704 • Published Mar 12 • 5
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published Jan 22 • 91