Submitted by Dongwei 52 Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback · 5 authors 3
Submitted by bracio9623 32 Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation · 7 authors 2
Submitted by wchai 20 LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming? · 19 authors 2
Submitted by LiuXR 20 Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache · 12 authors 4
Submitted by russwang 20 ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs · 13 authors 2
Submitted by cyrilzakka 14 Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards · 12 authors 2
Submitted by Ziruibest 14 SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning · 8 authors 2
Submitted by jinypark 10 DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO · 4 authors 2
Submitted by thomasschmied 8 pLSTM: parallelizable Linear Source Transition Mark networks · 5 authors 2
Submitted by cjeen 7 LoRA-Edit: Controllable First-Frame-Guided Video Editing via Mask-Aware LoRA Fine-Tuning · 6 authors 2
Submitted by kpzhang996 7 A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation · 11 authors 2
Submitted by yxK 7 SkillBlender: Towards Versatile Humanoid Whole-Body Loco-Manipulation via Skill Blending · 8 authors 2
Submitted by liranringel 6 Learning a Continue-Thinking Token for Enhanced Test-Time Scaling · 3 authors 2
Submitted by marksibrahim 6 AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions · 4 authors 2
Submitted by Splend1dchan 6 A Self-Refining Framework for Enhancing ASR Using TTS-Synthesized Data · 8 authors 2
Submitted by lxucs 6 Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings · 6 authors 2
Submitted by dawn0815 5 Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills · 8 authors 2
Submitted by ZacLiu 5 Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models · 8 authors 3
Submitted by bobxwu 4 Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning · 3 authors 2
Submitted by gabeorlanski 3 Reward Models Enable Scalable Code Verification by Trading Accuracy for Throughput · 4 authors 2
Submitted by ananthu-aniraj 3 Inherently Faithful Attention Maps for Vision Transformers · 4 authors 2
Submitted by MingxuanXia 3 Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation · 7 authors 2
Submitted by vicgalle 2 Configurable Preference Tuning with Rubric-Guided Synthetic Data · 1 authors 2