|
<! |
|
|
|
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with |
|
the License. You may obtain a copy of the License at |
|
|
|
http://www.apache.org/licenses/LICENSE-2.0 |
|
|
|
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on |
|
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the |
|
specific language governing permissions and limitations under the License. |
|
|
|
|
|
<p align="center"> |
|
<br> |
|
<img src="https://raw.githubusercontent.com/huggingface/diffusers/77aadfee6a891ab9fcfb780f87c693f7a5beeb8e/docs/source/imgs/diffusers_library.jpg" width="400"/> |
|
<br> |
|
</p> |
|
|
|
|
|
|
|
π€ Diffusersλ μ¬μ νμ΅λ λΉμ λ° μ€λμ€ νμ° λͺ¨λΈμ μ 곡νκ³ , μΆλ‘ λ° νμ΅μ μν λͺ¨λμ λꡬ μμ μν μ ν©λλ€. |
|
|
|
λ³΄λ€ μ ννκ², π€ Diffusersλ λ€μμ μ 곡ν©λλ€: |
|
|
|
- λ¨ λͺ μ€μ μ½λλ‘ μΆλ‘ μ μ€νν μ μλ μ΅μ νμ° νμ΄νλΌμΈμ μ 곡ν©λλ€. ([**Using Diffusers**](./using-diffusers/conditional_image_generation)λ₯Ό μ΄ν΄λ³΄μΈμ) μ§μλλ λͺ¨λ νμ΄νλΌμΈκ³Ό ν΄λΉ λ
Όλ¬Έμ λν κ°μλ₯Ό λ³΄λ €λ©΄ [**Pipelines**]( |
|
- μΆλ‘ μμ μλ vs νμ§μ μ μΆ©μ μν΄ μνΈκ΅νμ μΌλ‘ μ¬μ©ν μ μλ λ€μν λ
Έμ΄μ¦ μ€μΌμ€λ¬λ₯Ό μ 곡ν©λλ€. μμΈν λ΄μ©μ [**Schedulers**](./api/schedulers/overview)λ₯Ό μ°Έκ³ νμΈμ. |
|
- UNetκ³Ό κ°μ μ¬λ¬ μ νμ λͺ¨λΈμ end-to-end νμ° μμ€ν
μ κ΅¬μ± μμλ‘ μ¬μ©ν μ μμ΅λλ€. μμΈν λ΄μ©μ [**Models**](./api/models)μ μ°Έκ³ νμΈμ. |
|
- κ°μ₯ μΈκΈ°μλ νμ° λͺ¨λΈ ν
μ€ν¬λ₯Ό νμ΅νλ λ°©λ²μ 보μ¬μ£Όλ μμ λ€μ μ 곡ν©λλ€. μμΈν λ΄μ©μ [**Training**](./training/overview)λ₯Ό μ°Έκ³ νμΈμ. |
|
|
|
|
|
|
|
λ€μ νμλ 곡μμ μΌλ‘ μ§μλλ λͺ¨λ νμ΄νλΌμΈ, κ΄λ ¨ λ
Όλ¬Έ, μ§μ μ¬μ©ν΄ λ³Ό μ μλ Colab λ
ΈνΈλΆ(μ¬μ© κ°λ₯ν κ²½μ°)μ΄ μμ½λμ΄ μμ΅λλ€. |
|
|
|
| Pipeline | Paper | Tasks | Colab |
|
| |
|
| [alt_diffusion](./api/pipelines/alt_diffusion) | [**AltDiffusion**](https://arxiv.org/abs/2211.06679) | Image-to-Image Text-Guided Generation | |
|
| [audio_diffusion](./api/pipelines/audio_diffusion) | [**Audio Diffusion**](https://github.com/teticio/audio-diffusion.git) | Unconditional Audio Generation | [](https://colab.research.google.com/github/teticio/audio-diffusion/blob/master/notebooks/audio_diffusion_pipeline.ipynb) |
|
| [cycle_diffusion](./api/pipelines/cycle_diffusion) | [**Cycle Diffusion**](https://arxiv.org/abs/2210.05559) | Image-to-Image Text-Guided Generation | |
|
| [dance_diffusion](./api/pipelines/dance_diffusion) | [**Dance Diffusion**](https://github.com/williamberman/diffusers.git) | Unconditional Audio Generation | |
|
| [ddpm](./api/pipelines/ddpm) | [**Denoising Diffusion Probabilistic Models**](https://arxiv.org/abs/2006.11239) | Unconditional Image Generation | |
|
| [ddim](./api/pipelines/ddim) | [**Denoising Diffusion Implicit Models**](https://arxiv.org/abs/2010.02502) | Unconditional Image Generation | |
|
| [latent_diffusion](./api/pipelines/latent_diffusion) | [**High-Resolution Image Synthesis with Latent Diffusion Models**](https://arxiv.org/abs/2112.10752)| Text-to-Image Generation | |
|
| [latent_diffusion](./api/pipelines/latent_diffusion) | [**High-Resolution Image Synthesis with Latent Diffusion Models**](https://arxiv.org/abs/2112.10752)| Super Resolution Image-to-Image | |
|
| [latent_diffusion_uncond](./api/pipelines/latent_diffusion_uncond) | [**High-Resolution Image Synthesis with Latent Diffusion Models**](https://arxiv.org/abs/2112.10752) | Unconditional Image Generation | |
|
| [paint_by_example](./api/pipelines/paint_by_example) | [**Paint by Example: Exemplar-based Image Editing with Diffusion Models**](https://arxiv.org/abs/2211.13227) | Image-Guided Image Inpainting | |
|
| [pndm](./api/pipelines/pndm) | [**Pseudo Numerical Methods for Diffusion Models on Manifolds**](https://arxiv.org/abs/2202.09778) | Unconditional Image Generation | |
|
| [score_sde_ve](./api/pipelines/score_sde_ve) | [**Score-Based Generative Modeling through Stochastic Differential Equations**](https://openreview.net/forum?id=PxTIG12RRHS) | Unconditional Image Generation | |
|
| [score_sde_vp](./api/pipelines/score_sde_vp) | [**Score-Based Generative Modeling through Stochastic Differential Equations**](https://openreview.net/forum?id=PxTIG12RRHS) | Unconditional Image Generation | |
|
| [stable_diffusion](./api/pipelines/stable_diffusion/text2img) | [**Stable Diffusion**](https://stability.ai/blog/stable-diffusion-public-release) | Text-to-Image Generation | [](https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/training_example.ipynb) |
|
| [stable_diffusion](./api/pipelines/stable_diffusion/img2img) | [**Stable Diffusion**](https://stability.ai/blog/stable-diffusion-public-release) | Image-to-Image Text-Guided Generation | [](https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/image_2_image_using_diffusers.ipynb) |
|
| [stable_diffusion](./api/pipelines/stable_diffusion/inpaint) | [**Stable Diffusion**](https://stability.ai/blog/stable-diffusion-public-release) | Text-Guided Image Inpainting | [](https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/in_painting_with_stable_diffusion_using_diffusers.ipynb) |
|
| [stable_diffusion_2](./api/pipelines/stable_diffusion_2) | [**Stable Diffusion 2**](https://stability.ai/blog/stable-diffusion-v2-release) | Text-to-Image Generation | |
|
| [stable_diffusion_2](./api/pipelines/stable_diffusion_2) | [**Stable Diffusion 2**](https://stability.ai/blog/stable-diffusion-v2-release) | Text-Guided Image Inpainting | |
|
| [stable_diffusion_2](./api/pipelines/stable_diffusion_2) | [**Stable Diffusion 2**](https://stability.ai/blog/stable-diffusion-v2-release) | Text-Guided Super Resolution Image-to-Image | |
|
| [stable_diffusion_safe](./api/pipelines/stable_diffusion_safe) | [**Safe Stable Diffusion**](https://arxiv.org/abs/2211.05105) | Text-Guided Generation | [](https://colab.research.google.com/github/ml-research/safe-latent-diffusion/blob/main/examples/Safe%20Latent%20Diffusion.ipynb) |
|
| [stochastic_karras_ve](./api/pipelines/stochastic_karras_ve) | [**Elucidating the Design Space of Diffusion-Based Generative Models**](https://arxiv.org/abs/2206.00364) | Unconditional Image Generation | |
|
| [unclip](./api/pipelines/unclip) | [Hierarchical Text-Conditional Image Generation with CLIP Latents](https://arxiv.org/abs/2204.06125) | Text-to-Image Generation | |
|
| [versatile_diffusion](./api/pipelines/versatile_diffusion) | [Versatile Diffusion: Text, Images and Variations All in One Diffusion Model](https://arxiv.org/abs/2211.08332) | Text-to-Image Generation | |
|
| [versatile_diffusion](./api/pipelines/versatile_diffusion) | [Versatile Diffusion: Text, Images and Variations All in One Diffusion Model](https://arxiv.org/abs/2211.08332) | Image Variations Generation | |
|
| [versatile_diffusion](./api/pipelines/versatile_diffusion) | [Versatile Diffusion: Text, Images and Variations All in One Diffusion Model](https://arxiv.org/abs/2211.08332) | Dual Image and Text Guided Generation | |
|
| [vq_diffusion](./api/pipelines/vq_diffusion) | [Vector Quantized Diffusion Model for Text-to-Image Synthesis](https://arxiv.org/abs/2111.14822) | Text-to-Image Generation | |
|
|
|
**μ°Έκ³ **: νμ΄νλΌμΈμ ν΄λΉ λ¬Έμμ μ€λͺ
λ λλ‘ νμ° μμ€ν
μ μ¬μ©ν λ°©λ²μ λν κ°λ¨ν μμ
λλ€. |
|
|