File size: 29,845 Bytes
73daf53 e876f57 73daf53 fae1792 bfebeee 8ed7511 bfebeee 2e88dd6 bfebeee 61ffc1c 9f102be a7949c8 9f102be 83a38b0 61ffc1c 412e513 f921049 412e513 cfb3ff6 412e513 2573ba8 f921049 2573ba8 412e513 12c8a4a 61ffc1c e1cad8f ead1f6f e1cad8f efad7b5 ead1f6f e1cad8f efad7b5 e1cad8f efad7b5 e1cad8f efad7b5 a952f6a e1cad8f 5a39a0f e1cad8f 412e513 971d36e 9f102be 971d36e fb355db 7041fe5 fb355db 41e77fd aaf7427 61ffc1c 7b93d6d 919d468 7b93d6d 919d468 7b93d6d 919d468 a691a9b 919d468 7b93d6d 919d468 5e19c0a 919d468 5e19c0a 9f41a78 78649b0 73b1b2d b0f9180 69ca68a 9cb7abe 487e890 71cf042 bfebeee f6a58d6 948fb1f bfebeee 73daf53 61ffc1c 68ffa29 d0b2a90 a691a9b 68ffa29 16f49d8 68ffa29 16f49d8 68ffa29 16f49d8 68ffa29 16f49d8 68ffa29 16f49d8 68ffa29 16f49d8 aaf7427 16f49d8 61ffc1c 5e19c0a 948fb1f 3097ee2 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 |
---
base_model:
- black-forest-labs/FLUX.1-dev
pipeline_tag: text-to-image
library_name: gguf
license: other
license_name: flux-1-dev-non-commercial-license
tags:
- gguf
- flux
- text-to-image
- imatrix
---
# Supported?
Expect broken or faulty items for the time being. Use at your own discretion.
- ComfyUI-GGUF: all? (CPU/CUDA)
- Fast dequant: BF16, Q8_0, Q5_1, Q5_0, Q4_1, Q4_0, Q6_K, Q5_K, Q4_K, Q3_K, Q2_K
- Slow dequant: others [via GGUF/NumPy](https://github.com/city96/ComfyUI-GGUF/blob/379175e7bf8b65019cdd11108bb882120a6f17df/dequant.py#L24-L28)
- Forge: TBC
- stable-diffusion.cpp: [llama.cpp Feature-matrix](https://github.com/ggerganov/llama.cpp/wiki/Feature-matrix)
- CPU: all
- Cuda: all?
- Vulkan: >= Q3_K_S, > IQ4_S; [PR IQ1_S, IQ1_M](https://github.com/ggerganov/llama.cpp/pull/11528) [PR IQ4_XS](https://github.com/ggerganov/llama.cpp/pull/11501)
- other: ?
# Disco
Dynamic quantization:
- time_in.in_layer: Q8_0/Q6_K
- final_layer, vector_in.in_layer, guidance_in: Q8_0
- vector_in.out_layer, time_in.out_layer, txt_in, img_in: F16
- single_blocks.[> 10 && < 37].modulation.lin: one down?
| Filename | Quant type | File Size | Description / L2 Loss Step 25 | Example Image |
| -------- | ---------- | --------- | ----------------------------- | ------------- |
# Caesar
Combined imatrix multiple images 512x512 and 768x768, 25, 30 and 50 steps [city96/flux1-dev-Q8_0](https://huggingface.co/city96/FLUX.1-dev-gguf/blob/main/flux1-dev-Q8_0.gguf) euler
data: `load_imatrix: loaded 314 importance matrix entries from imatrix_caesar.dat computed on 475 chunks`
Using [llama.cpp quantize cae9fb4](https://github.com/ggerganov/llama.cpp/commit/cae9fb4361138b937464524eed907328731b81f6) with modified [lcpp.patch](https://github.com/city96/ComfyUI-GGUF/blob/main/tools/lcpp.patch).
Dynamic quantization:
- img_in, guidance_in.in_layer, final_layer.linear: f32/bf16/f16
- guidance_in, final_layer: bf16/f16
- img_attn.qkv, linear1: some layers two bits up
- txt_mod.lin, txt_mlp, txt_attn.proj: some layers one bit down
## Experimental from f16
| Filename | Quant type | File Size | Description / L2 Loss Step 25 | Example Image |
| -------- | ---------- | --------- | ----------------------------- | ------------- |
| [flux1-dev-IQ1_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ1_S.gguf) | IQ1_S | 2.41GB | worst / 173 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ1_S_512_25_woman.png) |
| [flux1-dev-TQ1_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-TQ1_0.gguf) | TQ1_0 | 2.64GB | worst / 195 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_TQ1_0_512_25_woman.png) |
| [flux1-dev-IQ1_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ1_M.gguf) | IQ1_M | 2.72GB | worst / 171 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ1_M_512_25_woman.png) |
| [flux1-dev-IQ2_XXS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ2_XXS.gguf) | IQ2_XXS | 3.10GB | worst * / 126 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ2_XXS_512_25_woman.png) |
| [flux1-dev-TQ2_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-TQ2_0.gguf) | TQ2_0 | 3.12GB | worst / 202 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_TQ2_0_512_25_woman.png) |
| [flux1-dev-IQ2_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ2_XS.gguf) | IQ2_XS | 3.48GB | worst / 140 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ2_XS_512_25_woman.png) |
| [flux1-dev-IQ2_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ2_S.gguf) | IQ2_S | 3.51GB | worst / 142 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ2_S_512_25_woman.png) |
| [flux1-dev-IQ2_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ2_M.gguf) | IQ2_M | 3.84GB | bad / 120 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ2_M_512_25_woman.png) |
| [flux1-dev-Q2_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q2_K_S.gguf) | Q2_K_S | 4.00GB | ok * / 52 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q2_K_S_512_25_woman.png) |
| [flux1-dev-Q2_K.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q2_K.gguf) | Q2_K | 4.03GB | ok / 55 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q2_K_512_25_woman.png) |
| [flux1-dev-IQ3_XXS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ3_XXS.gguf) | IQ3_XXS | 4.56GB | ok / 92 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ3_XXS_512_25_woman.png) |
| [flux1-dev-IQ3_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ3_XS.gguf) | IQ3_XS | 5.05GB | bad / 125 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ3_XS_512_25_woman.png) |
| [flux1-dev-Q3_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q3_K_S.gguf) | Q3_K_S | 5.10GB | ok / 48 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q3_K_S_512_25_woman.png) |
| [flux1-dev-IQ3_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ3_S.gguf) | IQ3_S | 5.11GB | bad / 123 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ3_S_512_25_woman.png) |
| [flux1-dev-Q3_K_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q3_K_M.gguf) | Q3_K_M | 5.13GB | ok / 50 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q3_K_M_512_25_woman.png) |
| [flux1-dev-IQ3_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ3_M.gguf) | IQ3_M | 5.14GB | bad / 123 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ3_M_512_25_woman.png) |
| [flux1-dev-Q3_K_L.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q3_K_L.gguf) | Q3_K_L | 5.17GB | ok / 61 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q3_K_L_512_25_woman.png) |
| [flux1-dev-IQ4_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ4_XS.gguf) | IQ4_XS | 6.33GB | good / 33 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ4_XS_512_25_woman.png) |
| [flux1-dev-Q4_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q4_K_S.gguf) | Q4_K_S | 6.66GB | good / 22 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q4_K_S_512_25_woman.png) |
| [flux1-dev-Q4_K_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q4_K_M.gguf) | Q4_K_M | 6.69GB | good / 21 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q4_K_M_512_25_woman.png) |
| [flux1-dev-IQ4_NL.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ4_NL.gguf) | IQ4_NL | 6.69GB | good / 24 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_IQ4_NL_512_25_woman.png) |
| [flux1-dev-Q4_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q4_0.gguf) | Q4_0 | 6.81GB | good / 30 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q4_0_512_25_woman.png) |
| [flux1-dev-Q4_1.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q4_1.gguf) | Q4_1 | 7.55GB | good / 27 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q4_1_512_25_woman.png) |
| [flux1-dev-Q5_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q5_K_S.gguf) | Q5_K_S | 8.26GB | nice / 21 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q5_K_S_512_25_woman.png) |
| [flux1-dev-Q5_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q5_0.gguf) | Q5_0 | 8.27GB | good / 30 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q5_0_512_25_woman.png) |
| [flux1-dev-Q5_K_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q5_K_M.gguf) | Q5_K_M | 8.30GB | nice / 23 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q5_K_M_512_25_woman.png) |
| [flux1-dev-Q5_1.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q5_1.gguf) | Q5_1 | 8.99GB | nice * / 14 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q5_1_512_25_woman.png) |
| [flux1-dev-Q6_K.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q6_K.gguf) | Q6_K | 9.80GB | nice / 20 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q6_K_512_25_woman.png) |
| [flux1-dev-Q8_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q8_0.gguf) | Q8_0 | 12.3GB | near perfect * / 8 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/images/output_test_caesar_Q8_0_512_25_woman.png) |
| - | F16 | 23.8GB | reference | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_F16_512_25_woman.png) |
| Filename | Bits img_attn.qkv & linear1 |
| -------- | --------------------------- |
| [flux1-dev-IQ1_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ1_S.gguf) | 333M MMMM M111 ... 11MM MM11 |
| [flux1-dev-TQ1_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-TQ1_0.gguf) | 3332 2222 2111 ... 1122 2211 |
| [flux1-dev-IQ1_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ1_M.gguf) | 3332 2222 2111 ... 1122 2211 |
| [flux1-dev-IQ2_XXS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ2_XXS.gguf) | 4433 3333 3222 ... 2222 |
| [flux1-dev-TQ2_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-TQ2_0.gguf) | 3332 2222 2111 ... 1122 2211 |
| [flux1-dev-IQ2_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ2_XS.gguf) | 4443 3333 3222 ... 2233 3322 |
| [flux1-dev-IQ2_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ2_S.gguf) | 4444 4444 4444 4444 4433 3222 ... 2233 3322 |
| [flux1-dev-IQ2_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ2_M.gguf) | 4444 4444 4444 4444 4433 3222 ... 2223 3333 3322 |
| [flux1-dev-Q2_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q2_K_S.gguf) | 4443 3333 3222 ... 2222 |
| [flux1-dev-Q2_K.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q2_K.gguf) | 4443 3333 3222 ... 2233 3322 |
| [flux1-dev-IQ3_XXS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ3_XXS.gguf) | 444S SSSS S333 ... 3333 |
| [flux1-dev-IQ3_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ3_XS.gguf) | 444S SSSS S333 ... 33SS SS33 |
| [flux1-dev-Q3_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q3_K_S.gguf) | 5554 4444 4333 ... 3333 |
| [flux1-dev-IQ3_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ3_S.gguf) | 5554 4444 4333 ... 3344 4433 |
| [flux1-dev-Q3_K_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q3_K_M.gguf) | 5554 4444 4333 ... 3344 4433 |
| [flux1-dev-IQ3_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ3_M.gguf) | 5554 4444 4444 4444 4433 ... 3344 4433 |
| [flux1-dev-Q3_K_L.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q3_K_L.gguf) | 5554 4444 4444 4444 4433 ... 3344 4433 |
| [flux1-dev-IQ4_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ4_XS.gguf) | 8885 5555 5444 ... 4444 |
| [flux1-dev-Q4_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q4_K_S.gguf) | 8885 5555 5444 ... 4444 |
| [flux1-dev-Q4_K_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q4_K_M.gguf) | 8885 5555 5555 5555 5544 ... 4444 |
| [flux1-dev-IQ4_NL.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-IQ4_NL.gguf) | 8885 5555 5555 5555 5544 ... 4444 |
| [flux1-dev-Q4_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q4_0.gguf) | 8885 5555 5444 ... 4444 |
| [flux1-dev-Q4_1.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q4_1.gguf) | 8885 5555 5444 ... 4444 |
| [flux1-dev-Q5_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q5_K_S.gguf) | FFF6 6666 6666 6666 6655 ... 5555 |
| [flux1-dev-Q5_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q5_0.gguf) | FFF8 8888 8555 ... 5555 |
| [flux1-dev-Q5_K_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q5_K_M.gguf) | FFF8 8888 8666 6666 6655 ... 5555 |
| [flux1-dev-Q5_1.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q5_1.gguf) | FFF8 8888 8555 ... 5555 |
| [flux1-dev-Q6_K.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q6_K.gguf) | FFF8 8888 8666 .. 6666 |
| [flux1-dev-Q8_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-caesar/flux1-dev-Q8_0.gguf) | FFF8 8888 .. 8888 |
## Observations
- More imatrix data doesn't necessarily result in better quants
- I-quants worse than same bits k-quants?
- [Quant-dequant loss](https://huggingface.co/Eviation/flux-imatrix/blob/main/images/loss_quants.png)
# Bravo
Combined imatrix multiple images 512x512 25 and 50 steps [city96/flux1-dev-Q8_0](https://huggingface.co/city96/FLUX.1-dev-gguf/blob/main/flux1-dev-Q8_0.gguf) euler
Using [llama.cpp quantize cae9fb4](https://github.com/ggerganov/llama.cpp/commit/cae9fb4361138b937464524eed907328731b81f6) with modified [lcpp.patch](https://github.com/city96/ComfyUI-GGUF/blob/main/tools/lcpp.patch).
## Experimental from f16
| Filename | Quant type | File Size | Description / L2 Loss Step 25 | Example Image |
| -------- | ---------- | --------- | ----------------------------- | ------------- |
| [flux1-dev-IQ1_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ1_S.gguf) | IQ1_S | 2.45GB | worst / 156 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_IQ1_S_512_25_woman.png) |
| [flux1-dev-IQ1_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ1_M.gguf) | IQ1_M | 2.72GB | worst / 141 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_IQ1_M_512_25_woman.png) |
| [flux1-dev-IQ2_XXS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ2_XXS.gguf) | IQ2_XXS | 3.19GB | worst / 131 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_IQ2_XXS_512_25_woman.png) |
| [flux1-dev-IQ2_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ2_XS.gguf) | IQ2_XS | 3.56GB | worst / 125 | - |
| [flux1-dev-IQ2_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ2_S.gguf) | IQ2_S | 3.56GB | worst / 125 | - |
| [flux1-dev-IQ2_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ2_M.gguf) | IQ2_M | 3.93GB | worst / 120 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_IQ2_M_512_25_woman.png) |
| [flux1-dev-Q2_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-Q2_K_S.gguf) | Q2_K_S | 4.02GB | ok / 56 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_Q2_K_S_512_25_woman.png) |
| [flux1-dev-IQ3_XXS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ3_XXS.gguf) | IQ3_XXS | 4.66GB | TBC / 68 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_IQ3_XXS_512_25_woman.png) |
| [flux1-dev-IQ3_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ3_XS.gguf) | IQ3_XS | 5.22GB | worse than IQ3_XXS / 115 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_IQ3_XS_512_25_woman.png) |
| flux1-dev-IQ3_S.gguf | IQ3_S | TBC | TBC | - |
| flux1-dev-IQ3_M.gguf | IQ3_M | TBC | TBC | - |
| [flux1-dev-Q3_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-Q3_K_S.gguf) | Q3_K_S | 5.22GB | TBC / 34 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_Q3_K_S_512_25_woman.png) |
| [flux1-dev-IQ4_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ4_XS.gguf) | IQ4_XS | 6.42GB | TBC / 25 | - |
| [flux1-dev-Q4_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-Q4_0.gguf) | Q4_0 | 6.79GB | TBC / 31 | - |
| [flux1-dev-IQ4_NL.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-IQ4_NL.gguf) | IQ4_NL | 6.79GB | TBC / 21 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_IQ4_NL_512_25_woman.png) |
| [flux1-dev-Q4_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-Q4_K_S.gguf) | Q4_K_S | 6.79GB | TBC / 29 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_Q4_K_S_512_25_woman.png) |
| [flux1-dev-Q4_1.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-Q4_1.gguf) | Q4_1 | 7.53GB | TBC / 24 | - |
| [flux1-dev-Q5_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-Q5_0.gguf) | Q5_0 | 8.27GB | TBC / 25 | - |
| flux1-dev-Q5_1.gguf | Q5_1 | TBC | TBC / 24 | - |
| [flux1-dev-Q5_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-Q5_K_S.gguf) | Q5_K_S | 8.27GB | TBC / 20 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_Q5_K_S_512_25_woman.png) |
| [flux1-dev-Q6_K.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/flux1-dev-Q6_K.gguf) | Q6_K | 9.84GB | TBC / 19 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-f16-combined/images/output_test_comb_Q6_K_512_25_woman.png) |
| flux1-dev-Q8_0.gguf | Q8_0 | - | TBC / 10 | - |
| - | F16 | 23.8GB | reference | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_F16_512_25_woman.png) |
## Observations
- Bravo IQ1_S worse than Alpha?
- [Latent loss](https://huggingface.co/Eviation/flux-imatrix/blob/main/images/latent_loss.png)
- [Per layer quantization cost](https://huggingface.co/Eviation/flux-imatrix/blob/main/images/casting_cost.png) from [chrisgoringe/casting_cost](https://github.com/chrisgoringe/mixed-gguf-converter/blob/main/costs/casting_cost.yaml)
- Per layer quantization cost 2 from [Freepik/flux.1-lite-8B](https://huggingface.co/Freepik/flux.1-lite-8B): [double blocks](https://huggingface.co/Freepik/flux.1-lite-8B/blob/main/sample_images/mse_mmdit_img.png) and [single blocks](https://huggingface.co/Freepik/flux.1-lite-8B/blob/main/sample_images/mse_dit_img.png)
- [Ablation latent loss per weight type](https://huggingface.co/Eviation/flux-imatrix/blob/main/images/latent_loss_ablation.png)
- [Pareto front loss vs. size](https://huggingface.co/Eviation/flux-imatrix/blob/main/images/latent_loss_size_pareto.png)
# Alpha
Simple imatrix: 512x512 single image 8/20 steps [city96/flux1-dev-Q3_K_S](https://huggingface.co/city96/FLUX.1-dev-gguf/blob/main/flux1-dev-Q3_K_S.gguf) euler
data: `load_imatrix: loaded 314 importance matrix entries from imatrix.dat computed on 7 chunks`.
Using [llama.cpp quantize cae9fb4](https://github.com/ggerganov/llama.cpp/commit/cae9fb4361138b937464524eed907328731b81f6) with modified [lcpp.patch](https://github.com/city96/ComfyUI-GGUF/blob/main/tools/lcpp.patch).
## Experimental from q8
| Filename | Quant type | File Size | Description / L2 Loss Step 25 | Example Image |
| -------- | ---------- | --------- | ----------------------------- | ------------- |
| [flux1-dev-IQ1_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ1_S.gguf) | IQ1_S | 2.45GB | worst / 152 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_IQ1_S_512_25_woman.png) |
| - | IQ1_M | - | broken | - |
| [flux1-dev-TQ1_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-TQ1_0.gguf) | TQ1_0| 2.63GB | TBC / 220 | - |
| [flux1-dev-TQ2_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-TQ2_0.gguf) | TQ2_0 | 3.19GB | TBC / 220 | - |
| [flux1-dev-IQ2_XXS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ2_XXS.gguf) | IQ2_XXS | 3.19GB | worst / 130 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_IQ2_XXS_512_25_woman.png) |
| [flux1-dev-IQ2_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ2_XS.gguf) | IQ2_XS | 3.56GB | worst / 129 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_IQ2_XS_512_25_woman.png) |
| [flux1-dev-IQ2_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ2_S.gguf) | IQ2_S | 3.56GB | worst / 129 | - |
| [flux1-dev-IQ2_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ2_M.gguf) | IQ2_M | 3.93GB | worst / 121 | - |
| [flux1-dev-Q2_K.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q2_K.gguf) | Q2_K | 4.02GB | TBC / 77 | - |
| [flux1-dev-Q2_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q2_K_S.gguf) | Q2_K_S | 4.02GB | ok / 77 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_Q2_K_S_512_25_woman.png) |
| [flux1-dev-IQ3_XXS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ3_XXS.gguf) | IQ3_XXS | 4.66GB | TBC / 130 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_IQ3_XXS_512_25_woman.png) |
| [flux1-dev-IQ3_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ3_XS.gguf) | IQ3_XS | 5.22GB | TBC / 114 | - |
| [flux1-dev-IQ3_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ3_S.gguf) | IQ3_S | 5.22GB | TBC / 114 | - |
| [flux1-dev-IQ3_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ3_M.gguf) | IQ3_M | 5.22GB | TBC / 114 | - |
| [flux1-dev-Q3_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q3_K_S.gguf) | Q3_K_S | 5.22GB | TBC / 36 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_Q3_K_S_512_25_woman.png) |
| [flux1-dev-Q3_K_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q3_K_K.gguf) | Q3_K_M | 5.36GB | TBC / 42 | - |
| [flux1-dev-Q3_K_L.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q3_K_L.gguf) | Q3_K_L | 5.36GB | TBC / 42 | - |
| [flux1-dev-IQ4_XS.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ4_XS.gguf) | IQ4_XS | 6.42GB | TBC / 30 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_IQ4_XS_512_25_woman.png) |
| [flux1-dev-IQ4_NL.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-IQ4_NL.gguf) | IQ4_NL | 6.79GB | TBC / 23 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_IQ4_NL_512_25_woman.png) |
| [flux1-dev-Q4_0.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q4_0.gguf) | Q4_0 | 6.79GB | TBC / 27 | - |
| - | Q4_K | TBC | TBC / 27 | - |
| [flux1-dev-Q4_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q4_K_S.gguf) | Q4_K_S | 6.79GB | TBC / 26 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_Q4_K_S_512_25_woman.png) |
| [flux1-dev-Q4_K_M.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q4_K_M.gguf) | Q4_K_M | 6.93GB | TBC / 27 | - |
| [flux1-dev-Q4_1.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q4_1.gguf) | Q4_1 | 7.53GB | TBC / 23 | - |
| [flux1-dev-Q5_K_S.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q5_K_S.gguf) | Q5_K_S | 8.27GB | TBC / 19 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_Q5_K_S_512_25_woman.png) |
| [flux1-dev-Q5_K.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q5_K.gguf) | Q5_K | 8.41GB | TBC / 20 | - |
| - | Q5_K_M | TBC | TBC | - |
| [flux1-dev-Q6_K.gguf](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/flux1-dev-Q6_K.gguf) | Q6_K | 9.84GB | TBC / 22 | - |
| - | Q8_0 | 12.7GB | near perfect / 10 | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_Q8_512_25_woman.png) |
| - | F16 | 23.8GB | reference | [Example](https://huggingface.co/Eviation/flux-imatrix/blob/main/experimental-from-q8/images/output_test_F16_512_25_woman.png) |
## Observations
Sub-quants not diferentiated as expected: IQ2_XS == IQ2_S, IQ3_XS == IQ3_S == IQ3_M, Q3_K_M == Q3_K_L.
- Check if [lcpp_sd3.patch](https://github.com/city96/ComfyUI-GGUF/blob/main/tools/lcpp_sd3.patch) includes more specific quant level logic
- Extrapolate the existing level logic
|