Accelerating Diffusion LLMs via Adaptive Parallel Decoding Paper • 2506.00413 • Published 7 days ago • 6
Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning Paper • 2506.03136 • Published 3 days ago • 22
Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers Paper • 2506.03065 • Published 3 days ago • 27
MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs Paper • 2506.01674 • Published 5 days ago • 24
OThink-R1: Intrinsic Fast/Slow Thinking Mode Switching for Over-Reasoning Mitigation Paper • 2506.02397 • Published 4 days ago • 33
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments Paper • 2506.02387 • Published 4 days ago • 55
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding Paper • 2506.01853 • Published 4 days ago • 27
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 4 days ago • 128
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning Paper • 2506.01713 • Published 4 days ago • 30
OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning Paper • 2506.00338 • Published 7 days ago • 8
From Guidelines to Practice: A New Paradigm for Arabic Language Model Evaluation Paper • 2506.01920 • Published 4 days ago • 4
MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability Paper • 2505.20285 • Published 11 days ago • 3
Aligning VLM Assistants with Personalized Situated Cognition Paper • 2506.00930 • Published 6 days ago • 2
How Programming Concepts and Neurons Are Shared in Code Language Models Paper • 2506.01074 • Published 5 days ago • 3
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions Paper • 2505.21724 • Published 10 days ago • 4
MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning Paper • 2505.24846 • Published 7 days ago • 15
ARIA: Training Language Agents with Intention-Driven Reward Aggregation Paper • 2506.00539 • Published 7 days ago • 26