image edit CoLLM: A Large Language Model for Composed Image Retrieval Paper • 2503.19910 • Published 2 days ago • 11
CoLLM: A Large Language Model for Composed Image Retrieval Paper • 2503.19910 • Published 2 days ago • 11
t2v DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published Dec 24, 2024 • 19
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published Dec 24, 2024 • 19