Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy Paper • 2506.22432 • Published 17 days ago • 12
Calligrapher: Freestyle Text Image Customization Paper • 2506.24123 • Published 14 days ago • 31
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation Paper • 2506.19852 • Published 20 days ago • 38
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model Paper • 2507.01953 • Published 12 days ago • 19
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation Paper • 2507.01957 • Published 12 days ago • 18
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper • 2506.16406 • Published 25 days ago • 119
Shifting AI Efficiency From Model-Centric to Data-Centric Compression Paper • 2505.19147 • Published May 25 • 146
Alchemist: Turning Public Text-to-Image Data into Generative Gold Paper • 2505.19297 • Published May 25 • 81
Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression Paper • 2505.19602 • Published May 26 • 13
Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps Paper • 2505.18675 • Published May 24 • 23
VeriThinker: Learning to Verify Makes Reasoning Model Efficient Paper • 2505.17941 • Published May 23 • 25
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL Paper • 2504.11455 • Published Apr 15 • 14
AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset Paper • 2503.19462 • Published Mar 25 • 10
Personalize Anything for Free with Diffusion Transformer Paper • 2503.12590 • Published Mar 16 • 44
FlowTok: Flowing Seamlessly Across Text and Image Tokens Paper • 2503.10772 • Published Mar 13 • 19