Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control Paper โข 2508.08134 โข Published 12 days ago โข 9
Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control Paper โข 2508.08134 โข Published 12 days ago โข 9 โข 2
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Paper โข 2506.09040 โข Published Jun 10 โข 35
Large Motion Video Autoencoding with Cross-modal Video VAE Paper โข 2412.17805 โข Published Dec 23, 2024 โข 24
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper โข 2412.02259 โข Published Dec 3, 2024 โข 61
FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs Paper โข 2407.02157 โข Published Jul 2, 2024
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper โข 2412.02259 โข Published Dec 3, 2024 โข 61 โข 5
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper โข 2412.02259 โข Published Dec 3, 2024 โข 61 โข 5
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper โข 2411.17440 โข Published Nov 26, 2024 โข 38
OmniCreator: Self-Supervised Unified Generation with Universal Editing Paper โข 2412.02114 โข Published Dec 3, 2024 โข 14
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper โข 2412.02259 โข Published Dec 3, 2024 โข 61