FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model Paper • 2507.01953 • Published Jul 2 • 18
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published Aug 5 • 51
SpineBench: A Clinically Salient, Level-Aware Benchmark Powered by the SpineMed-450k Corpus Paper • 2510.03160 • Published Oct 3 • 4
DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation Paper • 2512.02931 • Published 28 days ago
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 15 days ago • 72
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 15 days ago • 72
PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design Paper • 2512.04082 • Published 27 days ago • 13
PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design Paper • 2512.04082 • Published 27 days ago • 13
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14 • 164
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published Oct 9 • 125
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published Aug 5 • 51
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding Paper • 2507.15028 • Published Jul 20 • 21
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model Paper • 2507.01953 • Published Jul 2 • 18
VBench: Comprehensive Benchmark Suite for Video Generative Models Paper • 2311.17982 • Published Nov 29, 2023 • 9