LongAnimation: Long Animation Generation with Dynamic Global-Local Memory Paper • 2507.01945 • Published 6 days ago • 71
BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing Paper • 2506.17450 • Published 17 days ago • 60
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation Paper • 2506.18088 • Published 16 days ago • 17
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models Paper • 2506.19851 • Published 14 days ago • 56
Light of Normals: Unified Feature Representation for Universal Photometric Stereo Paper • 2506.18882 • Published 15 days ago • 84
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details Paper • 2506.16504 • Published 19 days ago • 23
Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material Paper • 2506.15442 • Published 20 days ago • 12
ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies Paper • 2506.14315 • Published 21 days ago • 10
BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models Paper • 2506.07961 • Published 29 days ago • 12
EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence Paper • 2506.10600 • Published 26 days ago • 7
Efficient Part-level 3D Object Generation via Dual Volume Packing Paper • 2506.09980 • Published 27 days ago • 8
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published 28 days ago • 95
Geometry-Editable and Appearance-Preserving Object Compositon Paper • 2505.20914 • Published May 27 • 6
FlexPainter: Flexible and Multi-View Consistent Texture Generation Paper • 2506.02620 • Published Jun 3 • 14
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Paper • 2506.04225 • Published Jun 4 • 25
LayerFlow: A Unified Model for Layer-aware Video Generation Paper • 2506.04228 • Published Jun 4 • 13
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models Paper • 2506.03135 • Published Jun 3 • 37
BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models Paper • 2505.22865 • Published May 28 • 2