Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ pipeline_tag: text-generation
|
|
| 11 |
- upload **in progress**/~done~ Tracker
|
| 12 |
- split bf16 parts: ~1~ ~2~ ~3~ **4**
|
| 13 |
- quants:
|
| 14 |
-
- ~IQ4_XS~ **Q4_K_M** Q5_K_M Q6_K
|
| 15 |
- Based on [ngxson/llama.cpp/pull/26](https://github.com/ngxson/llama.cpp/pull/26)@[46c8b70cbc7346db95e45ebae4f1e0c68a9b8d86](https://github.com/ngxson/llama.cpp/pull/26/commits/46c8b70cbc7346db95e45ebae4f1e0c68a9b8d86)
|
| 16 |
- which is based on [ggml-org/llama.cpp/pull/14425](https://github.com/ggml-org/llama.cpp/pull/14425)
|
| 17 |
- supposedly works *mostly fine*™ when run with below args according to [ggml-org/llama.cpp/pull/14425#issuecomment-3017533726](https://github.com/ggml-org/llama.cpp/pull/14425#issuecomment-3017533726)
|
|
|
|
| 11 |
- upload **in progress**/~done~ Tracker
|
| 12 |
- split bf16 parts: ~1~ ~2~ ~3~ **4**
|
| 13 |
- quants:
|
| 14 |
+
- ~IQ4_XS~ **Q4_K_M** Q5_K_M Q6_K Q8_0 Q4_0 Q4_K_S Q5_K_S IQ2_XXS IQ3_XXS
|
| 15 |
- Based on [ngxson/llama.cpp/pull/26](https://github.com/ngxson/llama.cpp/pull/26)@[46c8b70cbc7346db95e45ebae4f1e0c68a9b8d86](https://github.com/ngxson/llama.cpp/pull/26/commits/46c8b70cbc7346db95e45ebae4f1e0c68a9b8d86)
|
| 16 |
- which is based on [ggml-org/llama.cpp/pull/14425](https://github.com/ggml-org/llama.cpp/pull/14425)
|
| 17 |
- supposedly works *mostly fine*™ when run with below args according to [ggml-org/llama.cpp/pull/14425#issuecomment-3017533726](https://github.com/ggml-org/llama.cpp/pull/14425#issuecomment-3017533726)
|