Submitted by PhoenixZ 92 MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization · 14 authors 38 3
Submitted by taesiri 47 UniVideo: Unified Understanding, Generation, and Editing for Videos · 8 authors 2
Submitted by Blue-Giant 42 From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning · 5 authors 2
Submitted by yjyjyj98 38 Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning KAIST AI 2 2
Submitted by taesiri 36 VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning · 10 authors 2
Submitted by jackzhang 33 The Alignment Waltz: Jointly Training Agents to Collaborate for Safety AI at Meta 2
Submitted by Kylin-ll 26 Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense AI at Meta 2
Submitted by tqfang229 26 NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents · 13 authors 2
Submitted by UML 23 ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation shanghai ailab 18 2
Submitted by tsq2000 21 DeepPrune: Parallel Scaling without Inter-trace Redundancy Knowledge Engineer Group @ Tsinghua University 9 2
Submitted by olafyiii 21 First Try Matters: Revisiting the Role of Reflection in Reasoning Models · 6 authors 2
Submitted by Foreshhh 18 LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions Fudan University 2
Submitted by YOKIMIYA 18 UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution Kuaishou Visual Generation and Interaction Center 3
Submitted by Changyao 17 NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints OpenGVLab 22 2
Submitted by xxyQwQ 15 CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards · 10 authors 4 2
Submitted by SoroushMehraban 15 PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Pickford 2
Submitted by Carlanlarkk 15 Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward Tencent 4 2
Submitted by canqin001 13 UNIDOC-BENCH: A Unified Benchmark for Document-Centric Multimodal RAG Salesforce 4 4
Submitted by ZetangForward 12 LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling Soochow University 1 2
Submitted by Wayne-lc 9 Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks KnowledgeXLab@Shanghai AI Lab 2
Submitted by Guan123 9 Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction Apple 2
Submitted by Luo-Yihong 8 Reinforcing Diffusion Models by Direct Group Preference Optimization · 3 authors 8 2
Submitted by taesiri 7 SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models · 8 authors 5 3
Submitted by worstcoder 7 Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency · 10 authors 2
Submitted by xiangh 7 Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window · 14 authors 2
Submitted by ChonghuaLiao 6 Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints · 4 authors 8 2
Submitted by hyc2026 5 Memory Retrieval and Consolidation in Large Language Models through Function Tokens ByteDance Seed 2
Submitted by Mr-Philo 5 Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training · 8 authors 2
Submitted by Co2y 4 UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections · 7 authors 43 3
Submitted by xymeow7 3 DexNDM: Closing the Reality Gap for Dexterous In-Hand Rotation via Joint-Wise Neural Dynamics Model · 3 authors 2
Submitted by zfj1998 3 A^2Search: Ambiguity-Aware Question Answering with Reinforcement Learning City University of Hong Kong 2 3
Submitted by lliutianc 3 OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment · 7 authors 2
Submitted by Franck-Dernoncourt 3 Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs · 7 authors 2
Submitted by jiahaoplus 2 Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models · 14 authors 2
Submitted by xuxw98 1 R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation · 7 authors 2
Submitted by andreasengelhardt 1 SViM3D: Stable Video Material Diffusion for Single Image 3D Generation Stability AI 2
Submitted by paischer101 1 GyroSwin: 5D Surrogates for Gyrokinetic Plasma Turbulence Simulations Johannes Kepler University 2
Submitted by ahmedhendawy19 1 Use the Online Network If You Can: Towards Fast and Stable Reinforcement Learning · 6 authors 2
Submitted by cfahlgren1 1 OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction · 9 authors 2
Submitted by ryancll118 1 Fidelity-Aware Data Composition for Robust Robot Generalization · 9 authors 2
Submitted by Saleh 1 Beyond Outliers: A Study of Optimizers Under Quantization Scalable Parallel Computing Laboratory (SPCL) 2
Submitted by ytgui - Search-R3: Unifying Reasoning and Embedding Generation in Large Language Models · 2 authors 5 2