GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching Paper • 2503.12720 • Published 7 days ago • 4
Long-Video Audio Synthesis with Multi-Agent Collaboration Paper • 2503.10719 • Published 10 days ago • 9
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Paper • 2503.13444 • Published 6 days ago • 13
Automated Movie Generation via Multi-Agent CoT Planning Paper • 2503.07314 • Published 13 days ago • 41
ObjectMover: Generative Object Movement with Video Prior Paper • 2503.08037 • Published 13 days ago • 4
AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models Paper • 2503.08417 • Published 12 days ago • 7
Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling Paper • 2503.08605 • Published 12 days ago • 24
PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM Paper • 2503.07111 • Published 13 days ago • 2
CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance Paper • 2503.10391 • Published 10 days ago • 10
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness Paper • 2503.10624 • Published 10 days ago • 7
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published 9 days ago • 115
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control Paper • 2503.05639 • Published 16 days ago • 22
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models Paper • 2503.05638 • Published 16 days ago • 17
Layered Image Vectorization via Semantic Simplification Paper • 2406.05404 • Published Jun 8, 2024 • 3
NaturalL2S: End-to-End High-quality Multispeaker Lip-to-Speech Synthesis with Differential Digital Signal Processing Paper • 2502.12002 • Published Feb 17 • 1