hofixD commited on
Commit
6963cdc
ยท
verified ยท
1 Parent(s): 5db4208

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +86 -53
README.md CHANGED
@@ -6,76 +6,109 @@ base_model:
6
  pipeline_tag: image-to-image
7
  ---
8
 
 
9
 
10
- # HiDream Img2Img ComfyUI Workflow
11
 
12
- This workflow enables advanced image-to-image generation using the HiDream model suite and Florence-2 prompt generator, designed for use with ComfyUI and Replicate.
 
 
13
 
14
- You can test this workflow directly on Replicate: [https://replicate.com/goodguy1963/hidream-l1-full-img2img](https://replicate.com/goodguy1963/hidream-l1-full-img2img)
 
15
 
16
- ## Overview
17
- - **Image-to-image generation** with HiDream diffusion model
18
- - **Florence-2** for prompt generation and captioning
19
- - **VAE encoding/decoding** and advanced CLIP-based text encoding
20
- - **Negative prompt** support for artifact reduction
21
- - **LOW VRAM MODE**
22
 
23
- ## Required Models & Credits
24
 
25
- ### Diffusion Model
26
- - **hidream_i1_full_fp16.safetensors**
27
- Place in: `ComfyUI/models/diffusion_models`
28
- [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/diffusion_models/hidream_i1_full_fp16.safetensors)
29
- **Thanks to [HiDream.ai](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI) for the model!**
30
 
31
- ## For low VRAM user - GPU with less than 24GB VRAM:
32
 
33
- - You can replace the standard Diffusion Model Loader with the custom node **Unet LOADER** from [city96/ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF) by city96.
34
- - Download the HiDream-I1 Full GGUF model from: [https://huggingface.co/city96/HiDream-I1-Full-gguf/tree/main](https://huggingface.co/city96/HiDream-I1-Full-gguf/tree/main)
35
- Place all in: ComfyUI/models/unet
36
 
37
- ### Text Encoders
38
- Place all in: `ComfyUI/models/text_encoders`
39
- - **clip_g_hidream.safetensors**
40
- [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/clip_g_hidream.safetensors)
41
- - **clip_l_hidream.safetensors**
42
- [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/clip_l_hidream.safetensors)
43
- - **llama_3.1_8b_instruct_fp8_scaled.safetensors**
44
- [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/llama_3.1_8b_instruct_fp8_scaled.safetensors)
45
- - **t5xxl_fp8_e4m3fn_scaled.safetensors**
46
- [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/t5xxl_fp8_e4m3fn_scaled.safetensors)
47
 
48
- ### VAE
49
- - **ae.safetensors**
50
- Place in: `ComfyUI/models/vae`
51
- [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/vae/ae.safetensors)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
52
 
53
- ### Florence-2 Prompt Generator (NO need to download - will be downloaded automatacally at runtime)
54
  - **Florence-2-large**
55
- [Microsoft Florence-2](https://huggingface.co/microsoft/Florence-2-large)
56
- **Thanks to [MiaoshouAI](https://huggingface.co/MiaoshouAI/Florence-2-large-PromptGen-v2.0) for the correct implementation!**
 
 
57
 
58
- ## Usage
59
- 1. Download all required models and place them in the correct directories as listed above.
60
- 2. Drag the workflow image in ComfyUI
61
- 3. Use the workflow to generate new images from your input images and prompts.
62
 
63
- ---
 
 
 
 
 
 
 
 
 
 
 
64
 
65
- ## For low VRAM user - GPU with less than 24GB VRAM:
 
 
 
 
66
 
67
- - You can replace the standard Diffusion Model Loader with the custom node **Unet LOADER** from [city96/ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF) by city96.
68
- - Download the HiDream-I1 Full GGUF model from: [https://huggingface.co/city96/HiDream-I1-Full-gguf/tree/main](https://huggingface.co/city96/HiDream-I1-Full-gguf/tree/main)
69
- - Follow the instructions in the custom node repository to set up and use the GGUF model with the Unet LOADER node.
70
 
71
- ## Workflow Diagram
72
- See the full workflow structure here:
73
- [WORKFLOW-HIDREAM-IMG2IMG.png](https://huggingface.co/hofixD/comfyui-hidream-l1-full-img2img/blob/main/WORKFLOW-HIDREAM-IMG2IMG.png)
74
 
75
- ## Acknowledgements
76
- - **HiDream.ai** for the diffusion model and encoders
77
- - **Microsoft** for Florence-2
 
78
  - **MiaoshouAI** for the Florence-2 prompt generator implementation
79
- - **ComfyUI** team for the UI and workflow engine
 
 
 
80
 
81
- Thank you to all model creators and contributors!
 
 
 
6
  pipeline_tag: image-to-image
7
  ---
8
 
9
+ <div align="center">
10
 
11
+ # ๐ŸŒŸ HiDream Img2Img ComfyUI Workflow
12
 
13
+ [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
14
+ [![Hugging Face](https://img.shields.io/badge/๐Ÿค—%20Hugging%20Face-Models-blue)](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI)
15
+ [![Replicate](https://img.shields.io/badge/Replicate-Demo-brightgreen)](https://replicate.com/goodguy1963/hidream-l1-full-img2img)
16
 
17
+ #### Advanced image-to-image generation with HiDream model suite and Florence-2 prompt generator
18
+ </div>
19
 
20
+ ## ๐Ÿ“‹ Overview
 
 
 
 
 
21
 
22
+ This workflow combines the power of HiDream diffusion models with Florence-2 captioning for enhanced image-to-image generation in ComfyUI:
23
 
24
+ - โœจ **Image-to-image generation** with the state-of-the-art HiDream diffusion model
25
+ - ๐Ÿ”ฎ **Florence-2** intelligent prompt generation and image captioning
26
+ - ๐Ÿ–ผ๏ธ **VAE encoding/decoding** and advanced CLIP-based text encoding
27
+ - ๐Ÿšซ **Negative prompt** support for artifact reduction
28
+ - ๐Ÿ’ป **Low VRAM mode** available for systems with limited resources
29
 
30
+ ## ๐Ÿš€ Try It Now!
31
 
32
+ You can test this workflow directly on Replicate:
33
+ [โ–ถ๏ธ Run on Replicate](https://replicate.com/goodguy1963/hidream-l1-full-img2img)
 
34
 
35
+ ## ๐Ÿ“ฅ Required Models & Setup
 
 
 
 
 
 
 
 
 
36
 
37
+ ### ๐ŸŽจ Diffusion Model
38
+
39
+ - **`hidream_i1_full_fp16.safetensors`**
40
+ ๐Ÿ“ Place in: `ComfyUI/models/diffusion_models`
41
+ ๐Ÿ“ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/diffusion_models/hidream_i1_full_fp16.safetensors)
42
+
43
+ > **Credit:** [HiDream.ai](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI)
44
+
45
+ ### ๐Ÿ“ Text Encoders
46
+
47
+ ๐Ÿ“ Place all in: `ComfyUI/models/text_encoders`
48
+
49
+ - **`clip_g_hidream.safetensors`**
50
+ ๐Ÿ“ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/clip_g_hidream.safetensors)
51
+
52
+ - **`clip_l_hidream.safetensors`**
53
+ ๐Ÿ“ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/clip_l_hidream.safetensors)
54
+
55
+ - **`llama_3.1_8b_instruct_fp8_scaled.safetensors`**
56
+ ๐Ÿ“ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/llama_3.1_8b_instruct_fp8_scaled.safetensors)
57
+
58
+ - **`t5xxl_fp8_e4m3fn_scaled.safetensors`**
59
+ ๐Ÿ“ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/text_encoders/t5xxl_fp8_e4m3fn_scaled.safetensors)
60
+
61
+ ### ๐Ÿ–ผ๏ธ VAE
62
+
63
+ - **`ae.safetensors`**
64
+ ๐Ÿ“ Place in: `ComfyUI/models/vae`
65
+ ๐Ÿ“ฆ [Download](https://huggingface.co/Comfy-Org/HiDream-I1_ComfyUI/blob/main/split_files/vae/ae.safetensors)
66
+
67
+ ### ๐Ÿ” Florence-2 Prompt Generator
68
 
 
69
  - **Florence-2-large**
70
+ โšก Automatic download at runtime
71
+ ๐Ÿ“ฆ [Microsoft Florence-2](https://huggingface.co/microsoft/Florence-2-large)
72
+
73
+ > **Credit:** [MiaoshouAI](https://huggingface.co/MiaoshouAI/Florence-2-large-PromptGen-v2.0) for the optimized implementation
74
 
75
+ ## ๐Ÿ’ก Usage Guide
 
 
 
76
 
77
+ 1. Download all required models and place them in the correct directories as listed above
78
+ 2. Import the workflow into ComfyUI
79
+ 3. Load your input image, adjust settings as needed
80
+ 4. Generate new images with enhanced quality
81
+
82
+ ## ๐Ÿ’ป Low VRAM Mode (< 24GB VRAM)
83
+
84
+ <div align="center">
85
+ <img src="https://img.shields.io/badge/Memory-Efficient-brightgreen" alt="Memory Efficient"/>
86
+ </div>
87
+
88
+ For systems with limited VRAM, use this alternative setup:
89
 
90
+ 1. Install [city96/ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF) custom node
91
+ 2. Replace the standard Diffusion Model Loader with the **Unet LOADER** node
92
+ 3. Download the optimized HiDream-I1 Full GGUF model:
93
+ - ๐Ÿ“ฆ [HiDream-I1-Full-gguf](https://huggingface.co/city96/HiDream-I1-Full-gguf/tree/main)
94
+ - ๐Ÿ“ Place in: `ComfyUI/models/unet`
95
 
96
+ ## ๐Ÿ“Š Workflow Diagram
 
 
97
 
98
+ <div align="center">
99
+ <img src="https://huggingface.co/hofixD/comfyui-hidream-l1-full-img2img/resolve/main/WORKFLOW-HIDREAM-IMG2IMG.png" alt="HiDream Workflow Diagram" width="85%"/>
100
+ </div>
101
 
102
+ ## ๐Ÿ™ Acknowledgements
103
+
104
+ - **HiDream.ai** for the remarkable diffusion model and encoders
105
+ - **Microsoft** for the Florence-2 vision-language model
106
  - **MiaoshouAI** for the Florence-2 prompt generator implementation
107
+ - **ComfyUI** team for the intuitive workflow engine
108
+ - **city96** for the GGUF optimization for low VRAM systems
109
+
110
+ ---
111
 
112
+ <div align="center">
113
+ <p>โญ If you find this workflow useful, please consider starring the repository! โญ</p>
114
+ </div>