CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing Paper • 2603.08589 • Published about 18 hours ago • 28
WildActor: Unconstrained Identity-Preserving Video Generation Paper • 2603.00586 • Published 10 days ago • 27
WildActor: Unconstrained Identity-Preserving Video Generation Paper • 2603.00586 • Published 10 days ago • 27 • 5
WildActor: Unconstrained Identity-Preserving Video Generation Paper • 2603.00586 • Published 10 days ago • 27
VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection Paper • 2603.00912 • Published 9 days ago • 35
VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection Paper • 2603.00912 • Published 9 days ago • 35
N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models Paper • 2512.16561 • Published Dec 18, 2025 • 20
N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models Paper • 2512.16561 • Published Dec 18, 2025 • 20
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation Paper • 2512.10949 • Published Dec 11, 2025 • 46
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation Paper • 2512.10949 • Published Dec 11, 2025 • 46
FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention Paper • 2512.01540 • Published Dec 1, 2025 • 5
FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention Paper • 2512.01540 • Published Dec 1, 2025 • 5
One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control Paper • 2511.18922 • Published Nov 24, 2025 • 13
One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control Paper • 2511.18922 • Published Nov 24, 2025 • 13
Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos Paper • 2510.18489 • Published Oct 21, 2025 • 6
Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos Paper • 2510.18489 • Published Oct 21, 2025 • 6
CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving Paper • 2510.07944 • Published Oct 9, 2025 • 25