TheBloke
/

wizardLM-7B-GGML

Model card Files Files and versions

TheBloke commited on Apr 26, 2023

Commit

37bde1f

·

1 Parent(s): 3e10dd5

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -13,6 +13,11 @@ The code for merging is provided in the [WizardLM official Github repo](https://
 This repo contains GGML files for for CPU inference using [llama.cpp](https://github.com/ggerganov/llama.cpp).
 ## Provided files
 | Name | Quant method | Bits | Size | RAM required | Use case |
 | ---- | ---- | ---- | ---- | ---- | ----- |

 This repo contains GGML files for for CPU inference using [llama.cpp](https://github.com/ggerganov/llama.cpp).
+## Other repositories available
+* [4bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/wizardLM-7B-GPTQ)
+* [Unquantised model in HF format](https://huggingface.co/TheBloke/wizardLM-7B-HF)
 ## Provided files
 | Name | Quant method | Bits | Size | RAM required | Use case |
 | ---- | ---- | ---- | ---- | ---- | ----- |