Update README.md
Browse files
README.md
CHANGED
@@ -8,14 +8,33 @@ base_model:
|
|
8 |
|
9 |
[](https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler)
|
10 |
|
11 |
-
A Non official custom nodes for ComfyUI that enables Upscale Video generation using [SeedVR2](https://github.com/ByteDance-Seed/SeedVR).
|
12 |
|
13 |
-
<
|
14 |
-
|
15 |
-
Your browser does not support the video tag.
|
16 |
-
</video>
|
17 |
|
18 |
-
<img src="docs/usage.png"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
|
20 |
## Features
|
21 |
|
@@ -25,6 +44,7 @@ A Non official custom nodes for ComfyUI that enables Upscale Video generation us
|
|
25 |
|
26 |
## Requirements
|
27 |
|
|
|
28 |
- Last ComfyUI version with python 3.12.9 (may be works with older versions but I haven't test it)
|
29 |
|
30 |
## Installation
|
@@ -73,36 +93,56 @@ python_embeded\python.exe -m pip install -r flash_attn
|
|
73 |
|
74 |
<img src="docs/node.png" width="100%">
|
75 |
|
|
|
|
|
|
|
|
|
76 |
2. Configure the node parameters:
|
77 |
|
78 |
- `model`: Select your 3B or 7B model
|
79 |
- `seed`: a seed but it generate another seed from this one
|
80 |
- `new_width`: New desired Width, will keep ration on height
|
81 |
- `cfg_scale`:
|
82 |
-
- `batch_size`: VERY IMPORTANT!, this model consume a lot of VRAM, All your VRAM, even for the 3B model, so for GPU under 24GB VRAM keep this value Low, good
|
83 |
-
- `
|
84 |
|
85 |
## Performance
|
86 |
|
87 |
-
|
88 |
-
|
89 |
-
|
90 |
-
|
91 |
-
|
92 |
-
|
93 |
-
|
94 |
-
|
95 |
-
|
96 |
-
|
97 |
-
|
98 |
-
|
99 |
-
|
100 |
-
|
101 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
102 |
|
103 |
## Limitations
|
104 |
|
105 |
-
- Use a lot of VRAM, it will take
|
106 |
- Processing speed depends on GPU capabilities
|
107 |
|
108 |
## Credits
|
@@ -114,3 +154,4 @@ python_embeded\python.exe -m pip install -r flash_attn
|
|
114 |
- The code in this repository is released under the MIT license as found in the [LICENSE file](LICENSE).
|
115 |
|
116 |
|
|
|
|
8 |
|
9 |
[](https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler)
|
10 |
|
11 |
+
A Non official custom nodes for ComfyUI that enables Upscale Video/Images generation using [SeedVR2](https://github.com/ByteDance-Seed/SeedVR).
|
12 |
|
13 |
+
<img src="docs/demo_01.jpg">
|
14 |
+
<img src="docs/demo_02.jpg">
|
|
|
|
|
15 |
|
16 |
+
<img src="docs/usage.png">
|
17 |
+
|
18 |
+
## π Todo
|
19 |
+
|
20 |
+
- Fixed unloading the 3B model when the process is finished (sorry about that, I'm trying to find out what's going on)
|
21 |
+
|
22 |
+
## π Updates
|
23 |
+
|
24 |
+
**2025.06.24**
|
25 |
+
|
26 |
+
- π Speed up the process until x4 (see new benchmark)
|
27 |
+
|
28 |
+
**2025.06.22**
|
29 |
+
|
30 |
+
- πͺ FP8 compatibility !
|
31 |
+
- π Speed Up all Process
|
32 |
+
- π less VRAM consumption (Stay high, batch_size=1 for RTX4090 max, I'm trying to fix that)
|
33 |
+
- π οΈ Better benchmark coming soon
|
34 |
+
|
35 |
+
**2025.06.20**
|
36 |
+
|
37 |
+
- π οΈ Initial push
|
38 |
|
39 |
## Features
|
40 |
|
|
|
44 |
|
45 |
## Requirements
|
46 |
|
47 |
+
- A Huge VRAM capabilities is better, from my test, even the 3B version need a lot of VRAM at least 18GB.
|
48 |
- Last ComfyUI version with python 3.12.9 (may be works with older versions but I haven't test it)
|
49 |
|
50 |
## Installation
|
|
|
93 |
|
94 |
<img src="docs/node.png" width="100%">
|
95 |
|
96 |
+
2. things to know
|
97 |
+
|
98 |
+
**temporal consistency** : at least a batch_size of 5 is required to activate temporal consistency
|
99 |
+
|
100 |
2. Configure the node parameters:
|
101 |
|
102 |
- `model`: Select your 3B or 7B model
|
103 |
- `seed`: a seed but it generate another seed from this one
|
104 |
- `new_width`: New desired Width, will keep ration on height
|
105 |
- `cfg_scale`:
|
106 |
+
- `batch_size`: VERY IMPORTANT!, this model consume a lot of VRAM, All your VRAM, even for the 3B model, so for GPU under 24GB VRAM keep this value Low, good value is "1" without temporal consistency
|
107 |
+
- `preserve_vram`: for VRAM < 24GB, If true, It will unload unused models during process, longer but works, otherwise probably OOM with
|
108 |
|
109 |
## Performance
|
110 |
|
111 |
+
**NVIDIA H100 93GB VRAM** (values in parentheses are from the previous benchmark):
|
112 |
+
|
113 |
+
| Images | Resolution | Batch Size | Time fp8 (s) | FPS fp8 | Time fp16 (s) | FPS fp16 |
|
114 |
+
| ------ | ------------------- | ---------- | ---------------- | ----------- | ---------------- | ----------- |
|
115 |
+
| 3 | 512Γ768 β 1080Γ1620 | 1 | 10.18 (58.10) | 0.29 (0.05) | 10.67 (60.13) | 0.28 (0.05) |
|
116 |
+
| 15 | 512Γ768 β 1080Γ1620 | 5 | 26.71 (135.63) | 0.56 (0.11) | 27.75 (144.18) | 0.54 (0.10) |
|
117 |
+
| 27 | 512Γ768 β 1080Γ1620 | 9 | 33.97 (163.22) | 0.79 (0.17) | 35.08 (177.61) | 0.77 (0.15) |
|
118 |
+
| 39 | 512Γ768 β 1080Γ1620 | 13 | 41.01 (189.36) | 0.95 (0.21) | 42.08 (210.11) | 0.93 (0.19) |
|
119 |
+
| 51 | 512Γ768 β 1080Γ1620 | 17 | 48.12 (215.80) | 1.06 (0.24) | 49.44 (242.64) | 1.03 (0.21) |
|
120 |
+
| 63 | 512Γ768 β 1080Γ1620 | 21 | 55.40 (241.79) | 1.14 (0.26) | 56.70 (275.55) | 1.11 (0.23) |
|
121 |
+
| 75 | 512Γ768 β 1080Γ1620 | 25 | 62.60 (267.93) | 1.20 (0.28) | 63.80 (308.51) | 1.18 (0.24) |
|
122 |
+
| 123 | 512Γ768 β 1080Γ1620 | 41 | 91.38 (373.60) | 1.35 (0.33) | 92.90 (440.01) | 1.32 (0.28) |
|
123 |
+
| 243 | 512Γ768 β 1080Γ1620 | 81 | 164.25 (642.20) | 1.48 (0.38) | 166.09 (780.20) | 1.46 (0.31) |
|
124 |
+
| 363 | 512Γ768 β 1080Γ1620 | 121 | 238.18 (913.61) | 1.52 (0.40) | 239.80 (1114.32) | 1.51 (0.33) |
|
125 |
+
| 453 | 512Γ768 β 1080Γ1620 | 151 | 296.52 (1132.01) | 1.53 (0.40) | 298.65 (1384.86) | 1.52 (0.33) |
|
126 |
+
| 633 | 512Γ768 β 1080Γ1620 | 211 | 406.65 (1541.09) | 1.56 (0.41) | 409.44 (1887.62) | 1.55 (0.34) |
|
127 |
+
| 903 | 512Γ768 β 1080Γ1620 | 301 | OOM (OOM) | OOM (OOM) | OOM (OOM) | OOM (OOM) |
|
128 |
+
|
129 |
+
**NVIDIA RTX4090 24GB VRAM** (preserved_vram=off)
|
130 |
+
| Model | Images | Resolution | Batch Size | Time (seconds) | FPS | Note |
|
131 |
+
| ------------------------- | ------ | ------------------- | ---------- | -------------- | --- | --- |
|
132 |
+
| 3B fp8 | 5 | 512x768 β 1080x1620 | 1 | 22.52 | 0.22 | |
|
133 |
+
| 3B fp16 | 5 | 512x768 β 1080x1620 | 1 | 27.84 | 0.18 | |
|
134 |
+
| 7B fp8 | 5 | 512x768 β 1080x1620 | 1 | 75.51 | 0.07 | |
|
135 |
+
| 7B fp16 | 5 | 512x768 β 1080x1620 | 1 | 78.93 | 0.06 | |
|
136 |
+
| 3B fp8 | 10 | 512x768 β 1080x1620 | 5 | 39.75 | 0.15 | preserve_memory=on|
|
137 |
+
| 3B fp8 | 20 | 512x768 β 1080x1620 | 1 | 65.40 | 0.31 | |
|
138 |
+
| 3B fp16 | 20 | 512x768 β 1080x1620 | 1 | 91.12 | 0.22 | |
|
139 |
+
| 3B fp8 | 20 | 512x768 β 1280x1920 | 1 | 89.10 | 0.22 | |
|
140 |
+
| 3B fp8 | 20 | 512x768 β 1480x2220 | 1 | 136.08| 0.15 | |
|
141 |
+
| 3B fp8 | 20 | 512x768 β 1620x2430 | 1 | 191.28 | 0.10 | preserve_memory=on without GPU overload so longer 320sec |
|
142 |
|
143 |
## Limitations
|
144 |
|
145 |
+
- Use a lot of VRAM, it will take all!!
|
146 |
- Processing speed depends on GPU capabilities
|
147 |
|
148 |
## Credits
|
|
|
154 |
- The code in this repository is released under the MIT license as found in the [LICENSE file](LICENSE).
|
155 |
|
156 |
|
157 |
+
|