Submitted by AlexiaJM 83 Less is More: Recursive Reasoning with Tiny Networks Samsung SAIT AI Lab, Montreal 1.02k 4
Submitted by jiaruz2 59 TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning Amazon 2
Submitted by Ogkunal 57 Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs Fractal AI Research 12 1
Submitted by ZhuofengLi 31 In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Stanford AI 16 1
Submitted by LHL3341 18 Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning · 8 authors 6 1
Submitted by xw-eric 10 Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations UC Santa Barbara NLP Group 1
Submitted by X-iZhang 10 CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive Decoding University of Glasgow 2 2
Submitted by domejiraphon 9 ShapeGen4D: Towards High Quality 4D Shape Generation from Videos · 8 authors 1
Submitted by JohnWeck 9 Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation UCSC-VLAA 8 1
Submitted by yoavgur 8 Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context Tel Aviv University 1 1
Submitted by nielsr 8 OneFlow: Concurrent Mixed-Modal and Interleaved Generation with Edit Flows AI at Meta 3
Submitted by AdamF92 7 TensorBLEU: Vectorized GPU-based BLEU Score Implementation for Per-Sentence In-Training Evaluation Reactive AI 13 1
Submitted by gasolsun 7 GRACE: Generative Representation Learning via Contrastive Policy Optimization University of Illinois at Urbana-Champaign 1
Submitted by taesiri 6 HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video · 8 authors 10 1
Submitted by demfier 6 AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems · 6 authors 3
Submitted by MikaStars39 5 Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning? rednote-hilab 4 1
Submitted by taesiri 5 LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation · 8 authors 7 1
Submitted by sirano1004 5 Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization · 1 authors 0 1
Submitted by NanHUO 5 BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions The BIRD Team 262 1
Submitted by chromeNLP 5 Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning · 6 authors 0 2
Submitted by nielsr 4 Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models Massachusetts Institute of Technology 77 1
Submitted by soujanyaporia 4 Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics Deep Cognition and Language Research (DeCLaRe) Lab 1
Submitted by taesiri 3 EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark · 12 authors 1
Submitted by amazingj 3 CARE: Cognitive-reasoning Augmented Reinforcement for Emotional Support Conversation Qwen DianJin 1
Submitted by gagan3012 2 Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models · 4 authors 1
Submitted by rgoswami 2 Adaptive Pruning for Increased Robustness and Reduced Computational Overhead in Gaussian Process Accelerated Saddle Point Searches · 2 authors 0 1
Submitted by minwoosun 2 No Tokens Wasted: Leveraging Long Context in Biomedical Vision-Language Models · 11 authors 1 1
Submitted by AmberYifan 2 DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning · 8 authors 0 1
Submitted by swzwan 2 On Code-Induced Reasoning in LLMs Carnegie Mellon University Computer Science 1
Submitted by ayushzenith 1 SDQM: Synthetic Data Quality Metric for Object Detection Dataset Evaluation · 4 authors 2 1
Submitted by Itsuki-music 1 BACHI: Boundary-Aware Symbolic Chord Recognition Through Masked Iterative Decoding on Pop and Classical Music UCSD 1
Submitted by JonasGeiping 1 Training Dynamics Impact Post-Training Quantization Robustness · 3 authors 1
Submitted by taesiri 1 Deforming Videos to Masks: Flow Matching for Referring Video Segmentation · 9 authors 1
Submitted by joaompalmeiro 1 Benchmark It Yourself (BIY): Preparing a Dataset and Benchmarking AI Models for Scatterplot-Related Tasks · 4 authors 4 1
Submitted by liuganghuggingface 1 Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research · 4 authors 24 1
Submitted by glory-hyeok 1 Verifier-free Test-Time Sampling for Vision Language Action Models KAIST AI 2
Submitted by sirano1004 1 A Contextual Quality Reward Model for Reliable and Efficient Best-of-N Sampling · 1 authors 1
Submitted by DarshanDeshpande 1 MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments Patronus AI 1
Submitted by chengyzhao - DYMO-Hair: Generalizable Volumetric Dynamics Modeling for Robot Hair Manipulation · 7 authors 1
Submitted by nazneen - The Valley of Code Reasoning: Scaling Knowledge Distillation of Large Language Models Collinear AI 1
Submitted by huangchengchou - Revisiting Modeling and Evaluation Approaches in Speech Emotion Recognition: Considering Subjectivity of Annotators and Ambiguity of Emotions National Tsing Hua University 1
Submitted by rachneetkaur - ChartAgent: A Multimodal Agent for Visually Grounded Reasoning in Complex Chart Question Answering · 5 authors 1
Submitted by lrsbrgrn - HalluGuard: Evidence-Grounded Small Reasoning Models to Mitigate Hallucinations in Retrieval-Augmented Generation · 4 authors 1