Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper β’ 2504.12626 β’ Published Apr 17 β’ 51
You Only Cache Once: Decoder-Decoder Architectures for Language Models Paper β’ 2405.05254 β’ Published May 8, 2024 β’ 10
Training-free Long Video Generation with Chain of Diffusion Model Experts Paper β’ 2408.13423 β’ Published Aug 24, 2024 β’ 24
stabilityai/stable-diffusion-3-medium-diffusers Text-to-Image β’ Updated Jun 19, 2024 β’ 202k β’ β’ 400
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets Paper β’ 2311.15127 β’ Published Nov 25, 2023 β’ 13