Spaces:
Runtime error
A newer version of the Gradio SDK is available:
5.25.2
title: HiDream Ai Fast
emoji: 🏢
colorFrom: pink
colorTo: purple
sdk: gradio
sdk_version: 5.23.3
python_version: 3.1
app_file: app-fast.py
pinned: false
short_description: 'Unofficial HiDream-ai Spaces '
HiDream-I1
HiDream-I1
is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within seconds.
Project Updates
2025/4/7
: We've open-sourced the text-to-image model HiDream-I1.
Models
We offer both the full version and distilled models. For more information about the models, please refer to the link under Usage.
Name | Script | Inference Steps | HuggingFace repo |
---|---|---|---|
HiDream-I1-Full | inference.py | 50 | 🤗 HiDream-I1-Full |
HiDream-I1-Dev | inference.py | 28 | 🤗 HiDream-I1-Dev |
HiDream-I1-Fast | inference.py | 16 | 🤗 HiDream-I1-Fast |
Quick Start
Please make sure you have installed Flash Attention. We recommend CUDA versions 12.4 for the manual installation.
pip install -r requirements.txt
Then you can run the inference scripts to generate images:
# For full model inference
python ./inference.py --model_type full
# For distilled dev model inference
python ./inference.py --model_type dev
# For distilled fast model inference
python ./inference.py --model_type fast
Note: The inference script will automatically download
meta-llama/Meta-Llama-3.1-8B-Instruct
model files. If you encounter network issues, you can download these files ahead of time and place them in the appropriate cache directory to avoid download failures during inference.
Gradio Demo
We also provide a Gradio demo for interactive image generation. You can run the demo with:
python gradio_demo.py
Evaluation Metrics
DPG-Bench
Model | Overall | Global | Entity | Attribute | Relation | Other |
---|---|---|---|---|---|---|
PixArt-alpha | 71.11 | 74.97 | 79.32 | 78.60 | 82.57 | 76.96 |
SDXL | 74.65 | 83.27 | 82.43 | 80.91 | 86.76 | 80.41 |
DALL-E 3 | 83.50 | 90.97 | 89.61 | 88.39 | 90.58 | 89.83 |
Flux.1-dev | 83.79 | 85.80 | 86.79 | 89.98 | 90.04 | 89.90 |
SD3-Medium | 84.08 | 87.90 | 91.01 | 88.83 | 80.70 | 88.68 |
Janus-Pro-7B | 84.19 | 86.90 | 88.90 | 89.40 | 89.32 | 89.48 |
CogView4-6B | 85.13 | 83.85 | 90.35 | 91.17 | 91.14 | 87.29 |
HiDream-I1 | 85.89 | 76.44 | 90.22 | 89.48 | 93.74 | 91.83 |
GenEval
Model | Overall | Single Obj. | Two Obj. | Counting | Colors | Position | Color attribution |
---|---|---|---|---|---|---|---|
SDXL | 0.55 | 0.98 | 0.74 | 0.39 | 0.85 | 0.15 | 0.23 |
PixArt-alpha | 0.48 | 0.98 | 0.50 | 0.44 | 0.80 | 0.08 | 0.07 |
Flux.1-dev | 0.66 | 0.98 | 0.79 | 0.73 | 0.77 | 0.22 | 0.45 |
DALL-E 3 | 0.67 | 0.96 | 0.87 | 0.47 | 0.83 | 0.43 | 0.45 |
CogView4-6B | 0.73 | 0.99 | 0.86 | 0.66 | 0.79 | 0.48 | 0.58 |
SD3-Medium | 0.74 | 0.99 | 0.94 | 0.72 | 0.89 | 0.33 | 0.60 |
Janus-Pro-7B | 0.80 | 0.99 | 0.89 | 0.59 | 0.90 | 0.79 | 0.66 |
HiDream-I1 | 0.83 | 1.00 | 0.98 | 0.79 | 0.91 | 0.60 | 0.72 |
HPSv2.1 benchmark
Model | Averaged | Animation | Concept-art | Painting | Photo |
---|---|---|---|---|---|
Stable Diffusion v2.0 | 26.38 | 27.09 | 26.02 | 25.68 | 26.73 |
Midjourney V6 | 30.29 | 32.02 | 30.29 | 29.74 | 29.10 |
SDXL | 30.64 | 32.84 | 31.36 | 30.86 | 27.48 |
Dall-E3 | 31.44 | 32.39 | 31.09 | 31.18 | 31.09 |
SD3 | 31.53 | 32.60 | 31.82 | 32.06 | 29.62 |
Midjourney V5 | 32.33 | 34.05 | 32.47 | 32.24 | 30.56 |
CogView4-6B | 32.31 | 33.23 | 32.60 | 32.89 | 30.52 |
Flux.1-dev | 32.47 | 33.87 | 32.27 | 32.62 | 31.11 |
stable cascade | 32.95 | 34.58 | 33.13 | 33.29 | 30.78 |
HiDream-I1 | 33.82 | 35.05 | 33.74 | 33.88 | 32.61 |
License
The code in this repository and the HiDream-I1 models are licensed under MIT License.