Update README.md
Browse files
README.md
CHANGED
|
@@ -8,14 +8,33 @@ base_model:
|
|
| 8 |
|
| 9 |
[](https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler)
|
| 10 |
|
| 11 |
-
A Non official custom nodes for ComfyUI that enables Upscale Video generation using [SeedVR2](https://github.com/ByteDance-Seed/SeedVR).
|
| 12 |
|
| 13 |
-
<
|
| 14 |
-
|
| 15 |
-
Your browser does not support the video tag.
|
| 16 |
-
</video>
|
| 17 |
|
| 18 |
-
<img src="docs/usage.png"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
|
| 20 |
## Features
|
| 21 |
|
|
@@ -25,6 +44,7 @@ A Non official custom nodes for ComfyUI that enables Upscale Video generation us
|
|
| 25 |
|
| 26 |
## Requirements
|
| 27 |
|
|
|
|
| 28 |
- Last ComfyUI version with python 3.12.9 (may be works with older versions but I haven't test it)
|
| 29 |
|
| 30 |
## Installation
|
|
@@ -73,36 +93,56 @@ python_embeded\python.exe -m pip install -r flash_attn
|
|
| 73 |
|
| 74 |
<img src="docs/node.png" width="100%">
|
| 75 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 76 |
2. Configure the node parameters:
|
| 77 |
|
| 78 |
- `model`: Select your 3B or 7B model
|
| 79 |
- `seed`: a seed but it generate another seed from this one
|
| 80 |
- `new_width`: New desired Width, will keep ration on height
|
| 81 |
- `cfg_scale`:
|
| 82 |
-
- `batch_size`: VERY IMPORTANT!, this model consume a lot of VRAM, All your VRAM, even for the 3B model, so for GPU under 24GB VRAM keep this value Low, good
|
| 83 |
-
- `
|
| 84 |
|
| 85 |
## Performance
|
| 86 |
|
| 87 |
-
|
| 88 |
-
|
| 89 |
-
|
| 90 |
-
|
| 91 |
-
|
| 92 |
-
|
| 93 |
-
|
| 94 |
-
|
| 95 |
-
|
| 96 |
-
|
| 97 |
-
|
| 98 |
-
|
| 99 |
-
|
| 100 |
-
|
| 101 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 102 |
|
| 103 |
## Limitations
|
| 104 |
|
| 105 |
-
- Use a lot of VRAM, it will take
|
| 106 |
- Processing speed depends on GPU capabilities
|
| 107 |
|
| 108 |
## Credits
|
|
@@ -114,3 +154,4 @@ python_embeded\python.exe -m pip install -r flash_attn
|
|
| 114 |
- The code in this repository is released under the MIT license as found in the [LICENSE file](LICENSE).
|
| 115 |
|
| 116 |
|
|
|
|
|
|
| 8 |
|
| 9 |
[](https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler)
|
| 10 |
|
| 11 |
+
A Non official custom nodes for ComfyUI that enables Upscale Video/Images generation using [SeedVR2](https://github.com/ByteDance-Seed/SeedVR).
|
| 12 |
|
| 13 |
+
<img src="docs/demo_01.jpg">
|
| 14 |
+
<img src="docs/demo_02.jpg">
|
|
|
|
|
|
|
| 15 |
|
| 16 |
+
<img src="docs/usage.png">
|
| 17 |
+
|
| 18 |
+
## 🆙 Todo
|
| 19 |
+
|
| 20 |
+
- Fixed unloading the 3B model when the process is finished (sorry about that, I'm trying to find out what's going on)
|
| 21 |
+
|
| 22 |
+
## 🚀 Updates
|
| 23 |
+
|
| 24 |
+
**2025.06.24**
|
| 25 |
+
|
| 26 |
+
- 🚀 Speed up the process until x4 (see new benchmark)
|
| 27 |
+
|
| 28 |
+
**2025.06.22**
|
| 29 |
+
|
| 30 |
+
- 💪 FP8 compatibility !
|
| 31 |
+
- 🚀 Speed Up all Process
|
| 32 |
+
- 🚀 less VRAM consumption (Stay high, batch_size=1 for RTX4090 max, I'm trying to fix that)
|
| 33 |
+
- 🛠️ Better benchmark coming soon
|
| 34 |
+
|
| 35 |
+
**2025.06.20**
|
| 36 |
+
|
| 37 |
+
- 🛠️ Initial push
|
| 38 |
|
| 39 |
## Features
|
| 40 |
|
|
|
|
| 44 |
|
| 45 |
## Requirements
|
| 46 |
|
| 47 |
+
- A Huge VRAM capabilities is better, from my test, even the 3B version need a lot of VRAM at least 18GB.
|
| 48 |
- Last ComfyUI version with python 3.12.9 (may be works with older versions but I haven't test it)
|
| 49 |
|
| 50 |
## Installation
|
|
|
|
| 93 |
|
| 94 |
<img src="docs/node.png" width="100%">
|
| 95 |
|
| 96 |
+
2. things to know
|
| 97 |
+
|
| 98 |
+
**temporal consistency** : at least a batch_size of 5 is required to activate temporal consistency
|
| 99 |
+
|
| 100 |
2. Configure the node parameters:
|
| 101 |
|
| 102 |
- `model`: Select your 3B or 7B model
|
| 103 |
- `seed`: a seed but it generate another seed from this one
|
| 104 |
- `new_width`: New desired Width, will keep ration on height
|
| 105 |
- `cfg_scale`:
|
| 106 |
+
- `batch_size`: VERY IMPORTANT!, this model consume a lot of VRAM, All your VRAM, even for the 3B model, so for GPU under 24GB VRAM keep this value Low, good value is "1" without temporal consistency
|
| 107 |
+
- `preserve_vram`: for VRAM < 24GB, If true, It will unload unused models during process, longer but works, otherwise probably OOM with
|
| 108 |
|
| 109 |
## Performance
|
| 110 |
|
| 111 |
+
**NVIDIA H100 93GB VRAM** (values in parentheses are from the previous benchmark):
|
| 112 |
+
|
| 113 |
+
| Images | Resolution | Batch Size | Time fp8 (s) | FPS fp8 | Time fp16 (s) | FPS fp16 |
|
| 114 |
+
| ------ | ------------------- | ---------- | ---------------- | ----------- | ---------------- | ----------- |
|
| 115 |
+
| 3 | 512×768 → 1080×1620 | 1 | 10.18 (58.10) | 0.29 (0.05) | 10.67 (60.13) | 0.28 (0.05) |
|
| 116 |
+
| 15 | 512×768 → 1080×1620 | 5 | 26.71 (135.63) | 0.56 (0.11) | 27.75 (144.18) | 0.54 (0.10) |
|
| 117 |
+
| 27 | 512×768 → 1080×1620 | 9 | 33.97 (163.22) | 0.79 (0.17) | 35.08 (177.61) | 0.77 (0.15) |
|
| 118 |
+
| 39 | 512×768 → 1080×1620 | 13 | 41.01 (189.36) | 0.95 (0.21) | 42.08 (210.11) | 0.93 (0.19) |
|
| 119 |
+
| 51 | 512×768 → 1080×1620 | 17 | 48.12 (215.80) | 1.06 (0.24) | 49.44 (242.64) | 1.03 (0.21) |
|
| 120 |
+
| 63 | 512×768 → 1080×1620 | 21 | 55.40 (241.79) | 1.14 (0.26) | 56.70 (275.55) | 1.11 (0.23) |
|
| 121 |
+
| 75 | 512×768 → 1080×1620 | 25 | 62.60 (267.93) | 1.20 (0.28) | 63.80 (308.51) | 1.18 (0.24) |
|
| 122 |
+
| 123 | 512×768 → 1080×1620 | 41 | 91.38 (373.60) | 1.35 (0.33) | 92.90 (440.01) | 1.32 (0.28) |
|
| 123 |
+
| 243 | 512×768 → 1080×1620 | 81 | 164.25 (642.20) | 1.48 (0.38) | 166.09 (780.20) | 1.46 (0.31) |
|
| 124 |
+
| 363 | 512×768 → 1080×1620 | 121 | 238.18 (913.61) | 1.52 (0.40) | 239.80 (1114.32) | 1.51 (0.33) |
|
| 125 |
+
| 453 | 512×768 → 1080×1620 | 151 | 296.52 (1132.01) | 1.53 (0.40) | 298.65 (1384.86) | 1.52 (0.33) |
|
| 126 |
+
| 633 | 512×768 → 1080×1620 | 211 | 406.65 (1541.09) | 1.56 (0.41) | 409.44 (1887.62) | 1.55 (0.34) |
|
| 127 |
+
| 903 | 512×768 → 1080×1620 | 301 | OOM (OOM) | OOM (OOM) | OOM (OOM) | OOM (OOM) |
|
| 128 |
+
|
| 129 |
+
**NVIDIA RTX4090 24GB VRAM** (preserved_vram=off)
|
| 130 |
+
| Model | Images | Resolution | Batch Size | Time (seconds) | FPS | Note |
|
| 131 |
+
| ------------------------- | ------ | ------------------- | ---------- | -------------- | --- | --- |
|
| 132 |
+
| 3B fp8 | 5 | 512x768 → 1080x1620 | 1 | 22.52 | 0.22 | |
|
| 133 |
+
| 3B fp16 | 5 | 512x768 → 1080x1620 | 1 | 27.84 | 0.18 | |
|
| 134 |
+
| 7B fp8 | 5 | 512x768 → 1080x1620 | 1 | 75.51 | 0.07 | |
|
| 135 |
+
| 7B fp16 | 5 | 512x768 → 1080x1620 | 1 | 78.93 | 0.06 | |
|
| 136 |
+
| 3B fp8 | 10 | 512x768 → 1080x1620 | 5 | 39.75 | 0.15 | preserve_memory=on|
|
| 137 |
+
| 3B fp8 | 20 | 512x768 → 1080x1620 | 1 | 65.40 | 0.31 | |
|
| 138 |
+
| 3B fp16 | 20 | 512x768 → 1080x1620 | 1 | 91.12 | 0.22 | |
|
| 139 |
+
| 3B fp8 | 20 | 512x768 → 1280x1920 | 1 | 89.10 | 0.22 | |
|
| 140 |
+
| 3B fp8 | 20 | 512x768 → 1480x2220 | 1 | 136.08| 0.15 | |
|
| 141 |
+
| 3B fp8 | 20 | 512x768 → 1620x2430 | 1 | 191.28 | 0.10 | preserve_memory=on without GPU overload so longer 320sec |
|
| 142 |
|
| 143 |
## Limitations
|
| 144 |
|
| 145 |
+
- Use a lot of VRAM, it will take all!!
|
| 146 |
- Processing speed depends on GPU capabilities
|
| 147 |
|
| 148 |
## Credits
|
|
|
|
| 154 |
- The code in this repository is released under the MIT license as found in the [LICENSE file](LICENSE).
|
| 155 |
|
| 156 |
|
| 157 |
+
|