MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds Paper • 2508.14879 • Published 2 days ago • 37
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Paper • 2508.14041 • Published 3 days ago • 47
TexVerse: A Universe of 3D Objects with High-Resolution Textures Paper • 2508.10868 • Published 8 days ago • 14
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale Paper • 2508.10711 • Published 8 days ago • 134
CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing Paper • 2508.06937 • Published 13 days ago • 6
Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation Paper • 2508.07901 • Published 11 days ago • 38
Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing Paper • 2508.09192 • Published 15 days ago • 29
Matrix-3D: Omnidirectional Explorable 3D World Generation Paper • 2508.08086 • Published 11 days ago • 67
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding Paper • 2507.23478 • Published 22 days ago • 15
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation Paper • 2508.00782 • Published 21 days ago • 6
BANG: Dividing 3D Assets via Generative Exploded Dynamics Paper • 2507.21493 • Published 25 days ago • 61
Region-based Cluster Discrimination for Visual Representation Learning Paper • 2507.20025 • Published 27 days ago • 17
TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance Paper • 2507.18192 • Published 30 days ago • 7
MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper • 2507.14958 • Published Jul 20 • 46