🌞 May 2025 - Open works from the Chinese community Collection 43 items • Updated 1 day ago • 8
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation Paper • 2506.03147 • Published 3 days ago • 55
MAGREF: Masked Guidance for Any-Reference Video Generation Paper • 2505.23742 • Published 8 days ago • 9
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data Paper • 2505.18445 • Published 14 days ago • 63
SageAttention2++: A More Efficient Implementation of SageAttention2 Paper • 2505.21136 • Published 10 days ago • 43
Sci-Fi: Symmetric Constraint for Frame Inbetweening Paper • 2505.21205 • Published 10 days ago • 5
ImgEdit: A Unified Image Editing Dataset and Benchmark Paper • 2505.20275 • Published 11 days ago • 17
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation Paper • 2505.20292 • Published 11 days ago • 52
Model Merging in Pre-training of Large Language Models Paper • 2505.12082 • Published 20 days ago • 35
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design Paper • 2505.16175 • Published 16 days ago • 39
Scaling Diffusion Transformers Efficiently via μP Paper • 2505.15270 • Published 17 days ago • 32
Training-Free Efficient Video Generation via Dynamic Token Carving Paper • 2505.16864 • Published 15 days ago • 21
OpenS2V-Nexus Collection OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation • 5 items • Updated 10 days ago • 3
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation Paper • 2505.04512 • Published about 1 month ago • 35
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published Apr 24 • 88
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation Paper • 2407.17438 • Published Jul 24, 2024 • 26
Personalized Text-to-Image Generation with Auto-Regressive Models Paper • 2504.13162 • Published Apr 17 • 19
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning Paper • 2504.14509 • Published Apr 20 • 51
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework Paper • 2504.12395 • Published Apr 16 • 17