BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing Paper • 2506.17450 • Published 10 days ago • 32
Multi3DRefer: Grounding Text Description to Multiple 3D Objects Paper • 2309.05251 • Published Sep 11, 2023
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images Paper • 2406.11579 • Published Jun 17, 2024
Generalizable Articulated Object Reconstruction from Casually Captured RGBD Videos Paper • 2506.08334 • Published 21 days ago
TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach Paper • 2407.03245 • Published Jul 3, 2024
HSM: Hierarchical Scene Motifs for Multi-Scale Indoor Scene Generation Paper • 2503.16848 • Published Mar 21
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images Paper • 2406.11579 • Published Jun 17, 2024
NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes Paper • 2503.16375 • Published Mar 20 • 10
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks Paper • 2410.10563 • Published Oct 14, 2024 • 39
MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction Paper • 2402.12712 • Published Feb 20, 2024 • 18
MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping Paper • 2403.15951 • Published Mar 23, 2024 • 1
An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion Paper • 2408.03178 • Published Aug 6, 2024 • 41
An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion Paper • 2408.03178 • Published Aug 6, 2024 • 41
An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion Paper • 2408.03178 • Published Aug 6, 2024 • 41
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion Paper • 2307.01097 • Published Jul 3, 2023 • 10