FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Paper • 2501.08225 • Published 4 days ago • 17
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 4 days ago • 48
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published 8 days ago • 55
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 11 days ago • 48
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 19 items • Updated 10 days ago • 85
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper • 2501.03847 • Published 11 days ago • 22
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 11 days ago • 63
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper • 2501.02976 • Published 12 days ago • 51
TransPixar: Advancing Text-to-Video Generation with Transparency Paper • 2501.03006 • Published 12 days ago • 22
SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos Paper • 2412.09401 • Published Dec 12, 2024 • 2
Nested Attention: Semantic-aware Attention Values for Concept Personalization Paper • 2501.01407 • Published 16 days ago • 11
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration Paper • 2501.01320 • Published 16 days ago • 11
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Paper • 2501.00599 • Published 18 days ago • 41
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Paper • 2501.01423 • Published 16 days ago • 36
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper • 2501.01427 • Published 16 days ago • 49
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition Paper • 2412.19712 • Published 22 days ago • 14
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Paper • 2412.18605 • Published 25 days ago • 20
SpotLight: Shadow-Guided Object Relighting via Diffusion Paper • 2411.18665 • Published Nov 27, 2024 • 3
MotiF: Making Text Count in Image Animation with Motion Focal Loss Paper • 2412.16153 • Published 29 days ago • 6