Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper • 2504.12626 • Published 4 days ago • 43
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper • 2504.07866 • Published 11 days ago • 8
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Paper • 2504.06261 • Published 13 days ago • 101
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper • 2503.24379 • Published 21 days ago • 74
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Paper • 2504.01014 • Published 20 days ago • 62
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 14 days ago • 164
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation Paper • 2504.02160 • Published 19 days ago • 33
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 13 days ago • 146
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published 26 days ago • 49
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published Mar 11 • 62
SketchVideo: Sketch-based Video Generation and Editing Paper • 2503.23284 • Published 22 days ago • 22
When Less is Enough: Adaptive Token Reduction for Efficient Image Representation Paper • 2503.16660 • Published Mar 20 • 73
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published Mar 17 • 96
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published Mar 17 • 96