flux-imatrix / README.md
Eviation's picture
Update README.md
dd2240c verified
|
raw
history blame
5.58 kB
metadata
base_model:
  - black-forest-labs/FLUX.1-dev
pipeline_tag: text-to-image
tags:
  - gguf
  - flux
  - text-to-image
  - imatrix

Support

Bravo

Combined imatrix multiple images 25 and 50 steps city96/flux1-dev-Q8_0 euler

Experimental from q8/f16?

Filename Quant type File Size Description Example Image
- IQ1_S TBC TBC -

Alpha

Simple imatrix: 512x512 single image 8/20 steps city96/flux1-dev-Q3_K_S euler

data: load_imatrix: loaded 314 importance matrix entries from imatrix.dat computed on 7 chunks.

Using llama.cpp quantize cae9fb4 with modified lcpp.patch.

Experimental from q8

Filename Quant type File Size Description Example Image
flux1-dev-IQ1_S.gguf IQ1_S 2.45GB obviously bad quality Example
- IQ1_M - broken -
flux1-dev-TQ1_0.gguf TQ1_0 2.63GB TBC -
flux1-dev-TQ2_0.gguf TQ2_0 3.19GB TBC -
flux1-dev-IQ2_XXS.gguf IQ2_XXS 3.19GB TBC Example
flux1-dev-IQ2_XS.gguf IQ2_XS 3.56GB TBC -
flux1-dev-IQ2_S.gguf IQ2_S 3.56GB TBC -
flux1-dev-IQ2_M.gguf IQ2_M 3.93GB TBC -
flux1-dev-Q2_K.gguf Q2_K 4.02GB TBC -
flux1-dev-Q2_K_S.gguf Q2_K_S 4.02GB TBC -
flux1-dev-IQ3_XXS.gguf IQ3_XXS 4.66GB TBC -
flux1-dev-IQ3_XS.gguf IQ3_XS 5.22GB TBC -
flux1-dev-IQ3_S.gguf IQ3_S 5.22GB TBC -
flux1-dev-IQ3_M.gguf IQ3_M 5.22GB TBC -
flux1-dev-Q3_K_S.gguf Q3_K_S 5.22GB TBC -
flux1-dev-Q3_K_M.gguf Q3_K_M 5.36GB TBC -
flux1-dev-Q3_K_L.gguf Q3_K_L 5.36GB TBC -
flux1-dev-IQ4_XS.gguf IQ4_XS 6.42GB TBC -
flux1-dev-IQ4_NL.gguf IQ4_NL 6.79GB TBC -
flux1-dev-Q4_0.gguf Q4_0 6.79GB TBC -
- Q4_K TBC TBC -
flux1-dev-Q4_K_S.gguf Q4_K_S 6.79GB TBC -
flux1-dev-Q4_K_M.gguf Q4_K_M 6.93GB TBC -
flux1-dev-Q4_1.gguf Q4_0 7.53GB TBC -
- Q5_K TBC TBC -
- Q5_K_S TBC TBC -
- Q5_K_M TBC TBC -
- Q6_K TBC TBC -
- Q8_0 12.7GB TBC Example

Observations

Sub-quants not diferentiated as expected: IQ2_XS == IQ2_S, IQ3_XS == IQ3_S == IQ3_M, Q3_K_M == Q3_K_L.

  • Check if lcpp_sd3.patch includes more specific quant level logic
  • Extrapolate the existing level logic