SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL Paper • 2504.11455 • Published Apr 15 • 14
Running on Zero 321 321 OminiControl Art 🎨 Transform images into artistic styles like Studio Ghibli
AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset Paper • 2503.19462 • Published Mar 25 • 10
Personalize Anything for Free with Diffusion Transformer Paper • 2503.12590 • Published Mar 16 • 44
FlowTok: Flowing Seamlessly Across Text and Image Tokens Paper • 2503.10772 • Published Mar 13 • 19
Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers? Paper • 2503.10632 • Published Mar 13 • 14
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity Paper • 2503.07677 • Published Mar 10 • 85
Automated Movie Generation via Multi-Agent CoT Planning Paper • 2503.07314 • Published Mar 10 • 45
ObjectMover: Generative Object Movement with Video Prior Paper • 2503.08037 • Published Mar 11 • 4