Diffusers
Safetensors
WanPipeline
BrianChen1129 commited on
Commit
8c044e9
·
verified ·
1 Parent(s): dada511

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +44 -0
README.md ADDED
@@ -0,0 +1,44 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ ---
5
+ license: apache-2.0
6
+ ---
7
+
8
+ # FastVideo Wan2.1-VSA-T2V-14B-720P-Diffusers
9
+
10
+ <div>
11
+ <div align="center">
12
+ <a href="https://github.com/hao-ai-lab/FastVideo" target="_blank">FastVideo Team</a>&emsp;
13
+ </div>
14
+ <div align="center">
15
+ <a href="https://arxiv.org/pdf/2505.13389">Paper</a> |
16
+ <a href="https://github.com/hao-ai-lab/FastVideo">Code</a>
17
+ </div>
18
+ </div>
19
+
20
+
21
+ ## Model Overview
22
+ - This model is finetuned with [VSA](https://arxiv.org/pdf/2505.13389), based on [Wan-AI/Wan2.1-T2V-14B-Diffusers](https://huggingface.co/Wan-AI/Wan2.1-T2V-1.3B-Diffusers).
23
+ - It achieves up to 2.1x speed up on a single **H100** GPU.
24
+ - Both [finetuning](https://github.com/hao-ai-lab/FastVideo/blob/main/scripts/finetune/finetune_v1_VSA.sh) and [inference](https://github.com/hao-ai-lab/FastVideo/blob/main/scripts/inference/v1_inference_wan_VSA.sh) scripts are available in the [FastVideo](https://github.com/hao-ai-lab/FastVideo) repository.
25
+ - Try it out on **FastVideo** — we support a wide range of GPUs from **H100** to **4090**
26
+ - We use [FastVideo 720P Synthetic Wan dataset](https://huggingface.co/datasets/FastVideo/Wan-Syn_77x768x1280_250k) for training.
27
+
28
+
29
+
30
+ If you use Wan2.1-VSA-T2V-14B-720P-Diffusers model for your research, please cite our paper:
31
+ ```
32
+ @article{zhang2025vsa,
33
+ title={VSA: Faster Video Diffusion with Trainable Sparse Attention},
34
+ author={Zhang, Peiyuan and Huang, Haofeng and Chen, Yongqi and Lin, Will and Liu, Zhengzhong and Stoica, Ion and Xing, Eric and Zhang, Hao},
35
+ journal={arXiv preprint arXiv:2505.13389},
36
+ year={2025}
37
+ }
38
+ @article{zhang2025fast,
39
+ title={Fast video generation with sliding tile attention},
40
+ author={Zhang, Peiyuan and Chen, Yongqi and Su, Runlong and Ding, Hangliang and Stoica, Ion and Liu, Zhengzhong and Zhang, Hao},
41
+ journal={arXiv preprint arXiv:2502.04507},
42
+ year={2025}
43
+ }
44
+ ```