ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction Paper • 2412.12888 • Published Dec 17, 2024
EliGen: Entity-Level Controlled Image Generation with Regional Attention Paper • 2501.01097 • Published Jan 2
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper • 2504.17432 • Published 14 days ago • 38
StyleBooth: Image Style Editing with Multimodal Instruction Paper • 2404.12154 • Published Apr 18, 2024
SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning Paper • 2408.05517 • Published Aug 10, 2024 • 2
ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Paper • 2501.02487 • Published Jan 5
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published Mar 26 • 51
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published Mar 26 • 51
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published Mar 26 • 51
ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Paper • 2501.02487 • Published Jan 5