numz commited on
Commit
c63245c
Β·
verified Β·
1 Parent(s): fe3f6de

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +65 -24
README.md CHANGED
@@ -8,14 +8,33 @@ base_model:
8
 
9
  [![View Code](https://img.shields.io/badge/πŸ“‚_View_Code-GitHub-181717?style=for-the-badge&logo=github)](https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler)
10
 
11
- A Non official custom nodes for ComfyUI that enables Upscale Video generation using [SeedVR2](https://github.com/ByteDance-Seed/SeedVR).
12
 
13
- <video width="700px" controls>
14
- <source src="https://github.com/user-attachments/assets/8fbd6c1f-4246-4dbe-8819-4e684490c5f2" type="video/mp4">
15
- Your browser does not support the video tag.
16
- </video>
17
 
18
- <img src="docs/usage.png" width="700px">
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
 
20
  ## Features
21
 
@@ -25,6 +44,7 @@ A Non official custom nodes for ComfyUI that enables Upscale Video generation us
25
 
26
  ## Requirements
27
 
 
28
  - Last ComfyUI version with python 3.12.9 (may be works with older versions but I haven't test it)
29
 
30
  ## Installation
@@ -73,36 +93,56 @@ python_embeded\python.exe -m pip install -r flash_attn
73
 
74
  <img src="docs/node.png" width="100%">
75
 
 
 
 
 
76
  2. Configure the node parameters:
77
 
78
  - `model`: Select your 3B or 7B model
79
  - `seed`: a seed but it generate another seed from this one
80
  - `new_width`: New desired Width, will keep ration on height
81
  - `cfg_scale`:
82
- - `batch_size`: VERY IMPORTANT!, this model consume a lot of VRAM, All your VRAM, even for the 3B model, so for GPU under 24GB VRAM keep this value Low, good values are [1,5,9,13,...]
83
- - `vram_mode`: It will try to help with VRAM, but 'auto' is good
84
 
85
  ## Performance
86
 
87
- 1. **NVIDIA H100 93GB VRAM**
88
-
89
- 3B or 7B spike to 90+GB VRAM!! but fast!!
90
-
91
- - 3B Model, 97 images, from 512x768 to 1280x1920, batch_size=50 => Prompt executed in 338.63 seconds
92
- - 3B Model, 97 images, from 512x768 to 1280x1920, batch_size=10 => Prompt executed in 540.22 seconds
93
- - 3B Model, 97 images, from 512x768 to 720x1080, batch_size=10 => Prompt executed in 183.64 seconds
94
- - 7B Model, 50 images, 512x768 to 1080x1620, batch_size=50, Prompt executed in 166.89 seconds
95
- - 7B Model, 97 images, 512x768 to 1080x1620, batch_size=97, Prompt executed in 146.72 seconds
96
- - 7B Model, 200 images, 512x768 to 1080x1620, batch_size=200, Prompt executed in 266.14 seconds
97
-
98
- 2. **NVIDIA RTX4090 24GB VRAM**
99
-
100
- - 3B Model, 20 images, from 512x768 to 1080x1620, batch_size=1, Prompt executed in 1022.26 seconds
101
- -
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
102
 
103
  ## Limitations
104
 
105
- - Use a lot of VRAM, it will take alllllll!!!!
106
  - Processing speed depends on GPU capabilities
107
 
108
  ## Credits
@@ -114,3 +154,4 @@ python_embeded\python.exe -m pip install -r flash_attn
114
  - The code in this repository is released under the MIT license as found in the [LICENSE file](LICENSE).
115
 
116
 
 
 
8
 
9
  [![View Code](https://img.shields.io/badge/πŸ“‚_View_Code-GitHub-181717?style=for-the-badge&logo=github)](https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler)
10
 
11
+ A Non official custom nodes for ComfyUI that enables Upscale Video/Images generation using [SeedVR2](https://github.com/ByteDance-Seed/SeedVR).
12
 
13
+ <img src="docs/demo_01.jpg">
14
+ <img src="docs/demo_02.jpg">
 
 
15
 
16
+ <img src="docs/usage.png">
17
+
18
+ ## πŸ†™ Todo
19
+
20
+ - Fixed unloading the 3B model when the process is finished (sorry about that, I'm trying to find out what's going on)
21
+
22
+ ## πŸš€ Updates
23
+
24
+ **2025.06.24**
25
+
26
+ - πŸš€ Speed up the process until x4 (see new benchmark)
27
+
28
+ **2025.06.22**
29
+
30
+ - πŸ’ͺ FP8 compatibility !
31
+ - πŸš€ Speed Up all Process
32
+ - πŸš€ less VRAM consumption (Stay high, batch_size=1 for RTX4090 max, I'm trying to fix that)
33
+ - πŸ› οΈ Better benchmark coming soon
34
+
35
+ **2025.06.20**
36
+
37
+ - πŸ› οΈ Initial push
38
 
39
  ## Features
40
 
 
44
 
45
  ## Requirements
46
 
47
+ - A Huge VRAM capabilities is better, from my test, even the 3B version need a lot of VRAM at least 18GB.
48
  - Last ComfyUI version with python 3.12.9 (may be works with older versions but I haven't test it)
49
 
50
  ## Installation
 
93
 
94
  <img src="docs/node.png" width="100%">
95
 
96
+ 2. things to know
97
+
98
+ **temporal consistency** : at least a batch_size of 5 is required to activate temporal consistency
99
+
100
  2. Configure the node parameters:
101
 
102
  - `model`: Select your 3B or 7B model
103
  - `seed`: a seed but it generate another seed from this one
104
  - `new_width`: New desired Width, will keep ration on height
105
  - `cfg_scale`:
106
+ - `batch_size`: VERY IMPORTANT!, this model consume a lot of VRAM, All your VRAM, even for the 3B model, so for GPU under 24GB VRAM keep this value Low, good value is "1" without temporal consistency
107
+ - `preserve_vram`: for VRAM < 24GB, If true, It will unload unused models during process, longer but works, otherwise probably OOM with
108
 
109
  ## Performance
110
 
111
+ **NVIDIA H100 93GB VRAM** (values in parentheses are from the previous benchmark):
112
+
113
+ | Images | Resolution | Batch Size | Time fp8 (s) | FPS fp8 | Time fp16 (s) | FPS fp16 |
114
+ | ------ | ------------------- | ---------- | ---------------- | ----------- | ---------------- | ----------- |
115
+ | 3 | 512Γ—768 β†’ 1080Γ—1620 | 1 | 10.18 (58.10) | 0.29 (0.05) | 10.67 (60.13) | 0.28 (0.05) |
116
+ | 15 | 512Γ—768 β†’ 1080Γ—1620 | 5 | 26.71 (135.63) | 0.56 (0.11) | 27.75 (144.18) | 0.54 (0.10) |
117
+ | 27 | 512Γ—768 β†’ 1080Γ—1620 | 9 | 33.97 (163.22) | 0.79 (0.17) | 35.08 (177.61) | 0.77 (0.15) |
118
+ | 39 | 512Γ—768 β†’ 1080Γ—1620 | 13 | 41.01 (189.36) | 0.95 (0.21) | 42.08 (210.11) | 0.93 (0.19) |
119
+ | 51 | 512Γ—768 β†’ 1080Γ—1620 | 17 | 48.12 (215.80) | 1.06 (0.24) | 49.44 (242.64) | 1.03 (0.21) |
120
+ | 63 | 512Γ—768 β†’ 1080Γ—1620 | 21 | 55.40 (241.79) | 1.14 (0.26) | 56.70 (275.55) | 1.11 (0.23) |
121
+ | 75 | 512Γ—768 β†’ 1080Γ—1620 | 25 | 62.60 (267.93) | 1.20 (0.28) | 63.80 (308.51) | 1.18 (0.24) |
122
+ | 123 | 512Γ—768 β†’ 1080Γ—1620 | 41 | 91.38 (373.60) | 1.35 (0.33) | 92.90 (440.01) | 1.32 (0.28) |
123
+ | 243 | 512Γ—768 β†’ 1080Γ—1620 | 81 | 164.25 (642.20) | 1.48 (0.38) | 166.09 (780.20) | 1.46 (0.31) |
124
+ | 363 | 512Γ—768 β†’ 1080Γ—1620 | 121 | 238.18 (913.61) | 1.52 (0.40) | 239.80 (1114.32) | 1.51 (0.33) |
125
+ | 453 | 512Γ—768 β†’ 1080Γ—1620 | 151 | 296.52 (1132.01) | 1.53 (0.40) | 298.65 (1384.86) | 1.52 (0.33) |
126
+ | 633 | 512Γ—768 β†’ 1080Γ—1620 | 211 | 406.65 (1541.09) | 1.56 (0.41) | 409.44 (1887.62) | 1.55 (0.34) |
127
+ | 903 | 512Γ—768 β†’ 1080Γ—1620 | 301 | OOM (OOM) | OOM (OOM) | OOM (OOM) | OOM (OOM) |
128
+
129
+ **NVIDIA RTX4090 24GB VRAM** (preserved_vram=off)
130
+ | Model | Images | Resolution | Batch Size | Time (seconds) | FPS | Note |
131
+ | ------------------------- | ------ | ------------------- | ---------- | -------------- | --- | --- |
132
+ | 3B fp8 | 5 | 512x768 β†’ 1080x1620 | 1 | 22.52 | 0.22 | |
133
+ | 3B fp16 | 5 | 512x768 β†’ 1080x1620 | 1 | 27.84 | 0.18 | |
134
+ | 7B fp8 | 5 | 512x768 β†’ 1080x1620 | 1 | 75.51 | 0.07 | |
135
+ | 7B fp16 | 5 | 512x768 β†’ 1080x1620 | 1 | 78.93 | 0.06 | |
136
+ | 3B fp8 | 10 | 512x768 β†’ 1080x1620 | 5 | 39.75 | 0.15 | preserve_memory=on|
137
+ | 3B fp8 | 20 | 512x768 β†’ 1080x1620 | 1 | 65.40 | 0.31 | |
138
+ | 3B fp16 | 20 | 512x768 β†’ 1080x1620 | 1 | 91.12 | 0.22 | |
139
+ | 3B fp8 | 20 | 512x768 β†’ 1280x1920 | 1 | 89.10 | 0.22 | |
140
+ | 3B fp8 | 20 | 512x768 β†’ 1480x2220 | 1 | 136.08| 0.15 | |
141
+ | 3B fp8 | 20 | 512x768 β†’ 1620x2430 | 1 | 191.28 | 0.10 | preserve_memory=on without GPU overload so longer 320sec |
142
 
143
  ## Limitations
144
 
145
+ - Use a lot of VRAM, it will take all!!
146
  - Processing speed depends on GPU capabilities
147
 
148
  ## Credits
 
154
  - The code in this repository is released under the MIT license as found in the [LICENSE file](LICENSE).
155
 
156
 
157
+