Update README.md
Browse files
README.md
CHANGED
@@ -6,76 +6,109 @@ base_model:
|
|
6 |
pipeline_tag: image-to-image
|
7 |
---
|
8 |
|
|
|
9 |
|
10 |
-
# HiDream Img2Img ComfyUI Workflow
|
11 |
|
12 |
-
|
|
|
|
|
13 |
|
14 |
-
|
|
|
15 |
|
16 |
-
## Overview
|
17 |
-
- **Image-to-image generation** with HiDream diffusion model
|
18 |
-
- **Florence-2** for prompt generation and captioning
|
19 |
-
- **VAE encoding/decoding** and advanced CLIP-based text encoding
|
20 |
-
- **Negative prompt** support for artifact reduction
|
21 |
-
- **LOW VRAM MODE**
|
22 |
|
23 |
-
|
24 |
|
25 |
-
|
26 |
-
- **
|
27 |
-
|
28 |
-
|
29 |
-
|
30 |
|
31 |
-
##
|
32 |
|
33 |
-
|
34 |
-
|
35 |
-
Place all in: ComfyUI/models/unet
|
36 |
|
37 |
-
|
38 |
-
Place all in: `ComfyUI/models/text_encoders`
|
39 |
-
- **clip_g_hidream.safetensors**
|
40 |
-
[Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/clip_g_hidream.safetensors)
|
41 |
-
- **clip_l_hidream.safetensors**
|
42 |
-
[Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/clip_l_hidream.safetensors)
|
43 |
-
- **llama_3.1_8b_instruct_fp8_scaled.safetensors**
|
44 |
-
[Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/llama_3.1_8b_instruct_fp8_scaled.safetensors)
|
45 |
-
- **t5xxl_fp8_e4m3fn_scaled.safetensors**
|
46 |
-
[Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/t5xxl_fp8_e4m3fn_scaled.safetensors)
|
47 |
|
48 |
-
###
|
49 |
-
|
50 |
-
|
51 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
52 |
|
53 |
-
### Florence-2 Prompt Generator (NO need to download - will be downloaded automatacally at runtime)
|
54 |
- **Florence-2-large**
|
55 |
-
|
56 |
-
|
|
|
|
|
57 |
|
58 |
-
## Usage
|
59 |
-
1. Download all required models and place them in the correct directories as listed above.
|
60 |
-
2. Drag the workflow image in ComfyUI
|
61 |
-
3. Use the workflow to generate new images from your input images and prompts.
|
62 |
|
63 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
64 |
|
65 |
-
|
|
|
|
|
|
|
|
|
66 |
|
67 |
-
|
68 |
-
- Download the HiDream-I1 Full GGUF model from: [https://huggingface.co/city96/HiDream-I1-Full-gguf/tree/main](https://huggingface.co/city96/HiDream-I1-Full-gguf/tree/main)
|
69 |
-
- Follow the instructions in the custom node repository to set up and use the GGUF model with the Unet LOADER node.
|
70 |
|
71 |
-
|
72 |
-
|
73 |
-
|
74 |
|
75 |
-
## Acknowledgements
|
76 |
-
|
77 |
-
- **
|
|
|
78 |
- **MiaoshouAI** for the Florence-2 prompt generator implementation
|
79 |
-
- **ComfyUI** team for the
|
|
|
|
|
|
|
80 |
|
81 |
-
|
|
|
|
|
|
6 |
pipeline_tag: image-to-image
|
7 |
---
|
8 |
|
9 |
+
<div align="center">
|
10 |
|
11 |
+
# ๐ HiDream Img2Img ComfyUI Workflow
|
12 |
|
13 |
+
[](https://opensource.org/licenses/MIT)
|
14 |
+
[](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI)
|
15 |
+
[](https://replicate.com/goodguy1963/hidream-l1-full-img2img)
|
16 |
|
17 |
+
#### Advanced image-to-image generation with HiDream model suite and Florence-2 prompt generator
|
18 |
+
</div>
|
19 |
|
20 |
+
## ๐ Overview
|
|
|
|
|
|
|
|
|
|
|
21 |
|
22 |
+
This workflow combines the power of HiDream diffusion models with Florence-2 captioning for enhanced image-to-image generation in ComfyUI:
|
23 |
|
24 |
+
- โจ **Image-to-image generation** with the state-of-the-art HiDream diffusion model
|
25 |
+
- ๐ฎ **Florence-2** intelligent prompt generation and image captioning
|
26 |
+
- ๐ผ๏ธ **VAE encoding/decoding** and advanced CLIP-based text encoding
|
27 |
+
- ๐ซ **Negative prompt** support for artifact reduction
|
28 |
+
- ๐ป **Low VRAM mode** available for systems with limited resources
|
29 |
|
30 |
+
## ๐ Try It Now!
|
31 |
|
32 |
+
You can test this workflow directly on Replicate:
|
33 |
+
[โถ๏ธ Run on Replicate](https://replicate.com/goodguy1963/hidream-l1-full-img2img)
|
|
|
34 |
|
35 |
+
## ๐ฅ Required Models & Setup
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
36 |
|
37 |
+
### ๐จ Diffusion Model
|
38 |
+
|
39 |
+
- **`hidream_i1_full_fp16.safetensors`**
|
40 |
+
๐ Place in: `ComfyUI/models/diffusion_models`
|
41 |
+
๐ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/diffusion_models/hidream_i1_full_fp16.safetensors)
|
42 |
+
|
43 |
+
> **Credit:** [HiDream.ai](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI)
|
44 |
+
|
45 |
+
### ๐ Text Encoders
|
46 |
+
|
47 |
+
๐ Place all in: `ComfyUI/models/text_encoders`
|
48 |
+
|
49 |
+
- **`clip_g_hidream.safetensors`**
|
50 |
+
๐ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/clip_g_hidream.safetensors)
|
51 |
+
|
52 |
+
- **`clip_l_hidream.safetensors`**
|
53 |
+
๐ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/clip_l_hidream.safetensors)
|
54 |
+
|
55 |
+
- **`llama_3.1_8b_instruct_fp8_scaled.safetensors`**
|
56 |
+
๐ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/llama_3.1_8b_instruct_fp8_scaled.safetensors)
|
57 |
+
|
58 |
+
- **`t5xxl_fp8_e4m3fn_scaled.safetensors`**
|
59 |
+
๐ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/t5xxl_fp8_e4m3fn_scaled.safetensors)
|
60 |
+
|
61 |
+
### ๐ผ๏ธ VAE
|
62 |
+
|
63 |
+
- **`ae.safetensors`**
|
64 |
+
๐ Place in: `ComfyUI/models/vae`
|
65 |
+
๐ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/vae/ae.safetensors)
|
66 |
+
|
67 |
+
### ๐ Florence-2 Prompt Generator
|
68 |
|
|
|
69 |
- **Florence-2-large**
|
70 |
+
โก Automatic download at runtime
|
71 |
+
๐ฆ [Microsoft Florence-2](https://huggingface.co/microsoft/Florence-2-large)
|
72 |
+
|
73 |
+
> **Credit:** [MiaoshouAI](https://huggingface.co/MiaoshouAI/Florence-2-large-PromptGen-v2.0) for the optimized implementation
|
74 |
|
75 |
+
## ๐ก Usage Guide
|
|
|
|
|
|
|
76 |
|
77 |
+
1. Download all required models and place them in the correct directories as listed above
|
78 |
+
2. Import the workflow into ComfyUI
|
79 |
+
3. Load your input image, adjust settings as needed
|
80 |
+
4. Generate new images with enhanced quality
|
81 |
+
|
82 |
+
## ๐ป Low VRAM Mode (< 24GB VRAM)
|
83 |
+
|
84 |
+
<div align="center">
|
85 |
+
<img src="https://img.shields.io/badge/Memory-Efficient-brightgreen" alt="Memory Efficient"/>
|
86 |
+
</div>
|
87 |
+
|
88 |
+
For systems with limited VRAM, use this alternative setup:
|
89 |
|
90 |
+
1. Install [city96/ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF) custom node
|
91 |
+
2. Replace the standard Diffusion Model Loader with the **Unet LOADER** node
|
92 |
+
3. Download the optimized HiDream-I1 Full GGUF model:
|
93 |
+
- ๐ฆ [HiDream-I1-Full-gguf](https://huggingface.co/city96/HiDream-I1-Full-gguf/tree/main)
|
94 |
+
- ๐ Place in: `ComfyUI/models/unet`
|
95 |
|
96 |
+
## ๐ Workflow Diagram
|
|
|
|
|
97 |
|
98 |
+
<div align="center">
|
99 |
+
<img src="https://huggingface.co/hofixD/comfyui-hidream-l1-full-img2img/resolve/main/WORKFLOW-HIDREAM-IMG2IMG.png" alt="HiDream Workflow Diagram" width="85%"/>
|
100 |
+
</div>
|
101 |
|
102 |
+
## ๐ Acknowledgements
|
103 |
+
|
104 |
+
- **HiDream.ai** for the remarkable diffusion model and encoders
|
105 |
+
- **Microsoft** for the Florence-2 vision-language model
|
106 |
- **MiaoshouAI** for the Florence-2 prompt generator implementation
|
107 |
+
- **ComfyUI** team for the intuitive workflow engine
|
108 |
+
- **city96** for the GGUF optimization for low VRAM systems
|
109 |
+
|
110 |
+
---
|
111 |
|
112 |
+
<div align="center">
|
113 |
+
<p>โญ If you find this workflow useful, please consider starring the repository! โญ</p>
|
114 |
+
</div>
|