G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration Paper • 2508.11379 • Published 9 days ago • 12
Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models Paper • 2508.12945 • Published 5 days ago • 12
Representing Speech Through Autoregressive Prediction of Cochlear Tokens Paper • 2508.11598 • Published 8 days ago • 16
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model Paper • 2508.13009 • Published 5 days ago • 21
HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds Paper • 2508.12782 • Published 6 days ago • 24
Has GPT-5 Achieved Spatial Intelligence? An Empirical Study Paper • 2508.13142 • Published 5 days ago • 31
When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs Paper • 2508.11383 • Published 9 days ago • 38
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published 10 days ago • 46
Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds Paper • 2508.14892 • Published 3 days ago • 4
"Does the cafe entrance look accessible? Where is the door?" Towards Geospatial AI Agents for Visual Inquiries Paper • 2508.15752 • Published 2 days ago • 5
aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists Paper • 2508.15126 • Published 3 days ago • 14
ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling Paper • 2508.15767 • Published 2 days ago • 8
Visual Autoregressive Modeling for Instruction-Guided Image Editing Paper • 2508.15772 • Published 2 days ago • 7
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass Paper • 2508.15769 • Published 2 days ago • 13
Mobile-Agent-v3: Foundamental Agents for GUI Automation Paper • 2508.15144 • Published 3 days ago • 44