Submitted by MingxingLi 43 UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning · 8 authors 4
Submitted by siyue 38 Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective · 6 authors 1
Submitted by Emaad 27 This Time is Different: An Observability Perspective on Time Series Foundation Models · 17 authors 1
Submitted by PeterV09 23 Learn to Reason Efficiently with Adaptive Length-based Reward Shaping · 8 authors 1
Submitted by Amanda2023 17 When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning · 9 authors 1
Submitted by knightnemo 17 Vid2World: Crafting Video Diffusion Models to Interactive World Models · 5 authors 1
Submitted by JamesMile 14 Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs · 11 authors 1
Submitted by TongZheng1999 11 Learning to Reason via Mixture-of-Thought for Logical Reasoning · 5 authors 1
Submitted by nonstopfor 11 How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study · 11 authors 1
Submitted by nonstopfor 10 Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen! · 6 authors 1
Submitted by yangjunxiao2021 10 BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs · 12 authors 1
Submitted by xw-eric 8 Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space · 8 authors 1
Submitted by sinwang 8 ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning · 5 authors 1
Submitted by IvanTang 6 AutoMat: Enabling Automated Crystal Structure Reconstruction from Microscopy via Agentic Tool Use · 17 authors 1
Submitted by Ziruibest 5 Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs · 9 authors 1
Submitted by yanyc 4 VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models · 12 authors 1
Submitted by huangsiteng 3 VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL · 7 authors 1
Submitted by Ziruibest 3 Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models · 12 authors 1
Submitted by Mellen 3 PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration · 3 authors 1
Submitted by sunshinekevin 3 RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning · 6 authors 1
Submitted by craigwu 2 Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM · 3 authors 1
Submitted by zxbsmk 2 WebNovelBench: Placing LLM Novelists on the Web Novel Distribution · 3 authors 1
Submitted by bytehxf 2 DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling · 6 authors 1
Submitted by hisoka94 1 Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach · 5 authors 1
Submitted by ernlavr 1 MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations · 4 authors 1
Submitted by shainaraza 1 HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation · 8 authors 1
Submitted by yapeichang 1 BLEUBERI: BLEU is a surprisingly effective reward for instruction following · 7 authors 1
Submitted by Fengzhuo - BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms · 9 authors 1
Submitted by shivamag99 - The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning · 5 authors 1
Submitted by ishikaa - Language Specific Knowledge: Do Models Know Better in X than in English? · 3 authors 1
Submitted by NathanRoll - In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties · 6 authors 1