Submitted by Nothing2Say 17 PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning · 6 authors 1
Submitted by CSJianYang 6 T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables · 15 authors 1
Submitted by blaz-r 4 No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes · 3 authors 78 1
Submitted by Omartificial-Intelligence-Space 3 UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat · 1 authors 1
Submitted by RSW233 3 From reactive to cognitive: brain-inspired spatial intelligence for embodied agents · 7 authors 18 1
Submitted by sahsaeedi 2 How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench · 8 authors 1