DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers Paper • 2505.21541 • Published 18 days ago • 7
LayerFlow: A Unified Model for Layer-aware Video Generation Paper • 2506.04228 • Published 6 days ago • 13
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control Paper • 2506.01943 • Published 8 days ago • 24
Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models Paper • 2506.00996 • Published 10 days ago • 35
MAGREF: Masked Guidance for Any-Reference Video Generation Paper • 2505.23742 • Published 12 days ago • 9
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment Paper • 2505.18600 • Published 18 days ago • 45
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs Paper • 2505.19075 • Published 17 days ago • 21
HoliTom: Holistic Token Merging for Fast Video Large Language Models Paper • 2505.21334 • Published 15 days ago • 19
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation Paper • 2505.18875 • Published 17 days ago • 40
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data Paper • 2505.18445 • Published 18 days ago • 63
Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model Paper • 2505.17561 • Published 19 days ago • 30
RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers Paper • 2505.13344 • Published 23 days ago • 6
Training-Free Efficient Video Generation via Dynamic Token Carving Paper • 2505.16864 • Published 20 days ago • 21
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 21 days ago • 130
HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Paper • 2504.21650 • Published Apr 30 • 15