Grid2Matrix: Revealing Digital Agnosia in Vision-Language Models Paper • 2604.09687 • Published Apr 14 • 8
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents Paper • 2604.04247 • Published Apr 5 • 31
Can AI Agents Answer Your Data Questions? A Benchmark for Data Agents Paper • 2603.20576 • Published Mar 21 • 3
SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing Paper • 2603.08982 • Published Mar 9 • 16
V_1: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published Feb 13 • 59
AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions Paper • 2602.06008 • Published Feb 5 • 5
MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Paper • 2512.16909 • Published Dec 18, 2025 • 3
R-KV: Redundancy-aware KV Cache Compression for Reasoning Models Paper • 2505.24133 • Published May 30, 2025 • 2
Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation Paper • 2312.16610 • Published Dec 27, 2023
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering Paper • 2507.11527 • Published Jul 15, 2025 • 35
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection Paper • 2503.12271 • Published Mar 15, 2025 • 9
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper • 2501.03847 • Published Jan 7, 2025 • 23
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows Paper • 2412.01169 • Published Dec 2, 2024 • 13