Eviation
/

flux-imatrix

Model card Files Files and versions

Eviation commited on Feb 16

Commit

2573ba8

·

verified ·

1 Parent(s): c679689

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -30,6 +30,12 @@ data: `load_imatrix: loaded 314 importance matrix entries from imatrix_caesar.da
 Using [llama.cpp quantize cae9fb4](https://github.com/ggerganov/llama.cpp/commit/cae9fb4361138b937464524eed907328731b81f6) with modified [lcpp.patch](https://github.com/city96/ComfyUI-GGUF/blob/main/tools/lcpp.patch).
 ## Experimental from f16
 | Filename | Quant type | File Size | Description | Example Image |

 Using [llama.cpp quantize cae9fb4](https://github.com/ggerganov/llama.cpp/commit/cae9fb4361138b937464524eed907328731b81f6) with modified [lcpp.patch](https://github.com/city96/ComfyUI-GGUF/blob/main/tools/lcpp.patch).
+Dynamic quantization:
+- img_in, guidance_in.in_layer, final_layer.linear: f32/bf16/f16
+- guidance_in, final_layer: bf16/f16
+- img_attn.qkv, linear1: two bits up
+- txt_mod.lin, txt_mlp, txt_attn.proj: one bit down
 ## Experimental from f16
 | Filename | Quant type | File Size | Description | Example Image |