Geometry-Editable and Appearance-Preserving Object Compositon Paper • 2505.20914 • Published 15 days ago • 6
FlexPainter: Flexible and Multi-View Consistent Texture Generation Paper • 2506.02620 • Published 8 days ago • 14
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Paper • 2506.04225 • Published 6 days ago • 24
LayerFlow: A Unified Model for Layer-aware Video Generation Paper • 2506.04228 • Published 6 days ago • 13
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models Paper • 2506.03135 • Published 7 days ago • 37
BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models Paper • 2505.22865 • Published 13 days ago • 2
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding Paper • 2506.01853 • Published 8 days ago • 28
DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation Paper • 2505.21864 • Published 14 days ago • 9
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation Paper • 2505.24521 • Published 12 days ago • 15
AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views Paper • 2505.23716 • Published 12 days ago • 31
Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence Paper • 2505.23747 • Published 12 days ago • 67
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination Paper • 2505.21925 • Published 14 days ago • 35
MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search Paper • 2505.19209 • Published 16 days ago • 24
InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning Paper • 2505.18291 • Published 18 days ago • 2
Diffusion Classifiers Understand Compositionality, but Conditions Apply Paper • 2505.17955 • Published 19 days ago • 21
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback Paper • 2505.17873 • Published 19 days ago • 30
DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation Paper • 2505.18078 • Published 18 days ago • 6
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention Paper • 2505.17412 • Published 19 days ago • 18