Submitted by Junteng 77 WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents · 15 authors 3
Submitted by Lingaaaaaaa 56 Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models · 6 authors 237 5
Submitted by taesiri 37 Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents · 4 authors 1.31k 7
Submitted by wenjun-li 31 Reinforcement Learning Foundations for Deep Research Systems: A Survey · 11 authors 30 2
Submitted by glecorve 24 DivMerge: A divergence-based model merging method for multi-tasking · 4 authors 2
Submitted by YuyaoGe 18 Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning · 9 authors 2
Submitted by cxiong 17 SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents · 7 authors 2
Submitted by dorni 13 UniVerse-1: Unified Audio-Video Generation via Stitching of Experts · 10 authors 89 2
Submitted by taesiri 11 Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers · 5 authors 2
Submitted by lioooox 11 Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play? · 9 authors 17 2
Submitted by MElHuseyni 9 Guided Decoding and Its Critical Role in Retrieval-Augmented Generation · 7 authors 2
Submitted by JamesXZ 8 Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet · 3 authors 9 2
Submitted by UVSKKR 5 D-HUMOR: Dark Humor Understanding via Multimodal Open-ended Reasoning · 6 authors 6 2
Submitted by stefan-it 5 Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian · 8 authors 2
Submitted by LuJingyi 5 Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping · 2 authors 41 2
Submitted by Youbang 3 R^textbf{2AI}: Towards Resistant and Resilient AI in an Evolving World · 5 authors 2
Submitted by sileod 2 Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem · 2 authors 24 2
Submitted by lgy0404 2 MAS-Bench: A Unified Benchmark for Shortcut-Augmented Hybrid Mobile GUI Agents · 11 authors 10 2
Submitted by TahaKoleilat 2 Singular Value Few-shot Adaptation of Vision-Language Models · 3 authors 7 2
Submitted by bearhaon 2 Mechanistic interpretability for steering vision-language-action models · 4 authors 2
Submitted by xchu123 1 DCReg: Decoupled Characterization for Efficient Degenerate LiDAR Registration · 6 authors 108 2