arxiv:2510.20822

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Published on Oct 23

· Submitted by

taesiri on Oct 24

Ant Group

Upvote

Authors:

Yue Yu ,

Qiuyu Wang ,

Wen Wang ,

Abstract

HoloCine generates coherent multi-shot narratives using a Window Cross-Attention mechanism and Sparse Inter-Shot Self-Attention, enabling end-to-end cinematic creation.

AI-generated summary

State-of-the-art text-to-video models excel at generating isolated clips but fall short of creating the coherent, multi-shot narratives, which are the essence of storytelling. We bridge this "narrative gap" with HoloCine, a model that generates entire scenes holistically to ensure global consistency from the first shot to the last. Our architecture achieves precise directorial control through a Window Cross-Attention mechanism that localizes text prompts to specific shots, while a Sparse Inter-Shot Self-Attention pattern (dense within shots but sparse between them) ensures the efficiency required for minute-scale generation. Beyond setting a new state-of-the-art in narrative coherence, HoloCine develops remarkable emergent abilities: a persistent memory for characters and scenes, and an intuitive grasp of cinematic techniques. Our work marks a pivotal shift from clip synthesis towards automated filmmaking, making end-to-end cinematic creation a tangible future. Our code is available at: https://holo-cine.github.io/.

View arXiv page View PDF Project page GitHub 147 Add to collection

Community

taesiri

Paper submitter 2 days ago

HoloCine is a text-to-video framework that holistically generates coherent, cinematic multi-shot video narratives from a single prompt, combining Window Cross-Attention for per-shot control and Sparse Inter-Shot Self-Attention for efficient, consistent long-scene generation.

wwen1997

Paper author 1 day ago

Thanks a lot @taesiri for helping us submit our paper to the daily papers! 🙏
Could we please use the following video as the cover to better showcase our results?
🎥 https://holo-cine.github.io/holocine.mp4