UniVerse-1: Unified Audio-Video Generation via Stitching of Experts Paper โข 2509.06155 โข Published 26 days ago โข 13
Motion2Motion: Cross-topology Motion Transfer with Sparse Correspondence Paper โข 2508.13139 โข Published Aug 18 โข 4
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer Paper โข 2508.09131 โข Published Aug 12 โข 16
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer Paper โข 2508.09131 โข Published Aug 12 โข 16
Motion2Motion: Cross-topology Motion Transfer with Sparse Correspondence Paper โข 2508.13139 โข Published Aug 18 โข 4 โข 2
LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs Paper โข 2506.21862 โข Published Jun 27 โข 36
HumanMM: Global Human Motion Recovery from Multi-shot Videos Paper โข 2503.07597 โข Published Mar 10 โข 2
HumanMM: Global Human Motion Recovery from Multi-shot Videos Paper โข 2503.07597 โข Published Mar 10 โข 2 โข 1
view article Article MotionLCM-V2: Improved Compression Rate for Multi-Latent-Token Diffusion By wxDai โข Dec 11, 2024 โข 17
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video Paper โข 2411.18671 โข Published Nov 27, 2024 โข 20
DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding Paper โข 2411.14347 โข Published Nov 21, 2024 โข 15 โข 3
MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms Paper โข 2410.18977 โข Published Oct 24, 2024 โข 15