Submitted by Nothing2Say 32 PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning · 6 authors 2
Submitted by CSJianYang 25 T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables · 15 authors 4
Submitted by sahsaeedi 15 How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench · 8 authors 2
Submitted by blaz-r 9 No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes · 3 authors 103 3
Submitted by RSW233 9 From reactive to cognitive: brain-inspired spatial intelligence for embodied agents · 7 authors 61 2
Submitted by Omartificial-Intelligence-Space 7 UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat · 1 authors 2
Submitted by Soontosh 2 Democracy-in-Silico: Institutional Design as Alignment in AI-Governed Polities · 2 authors 2