Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,32 @@
|
|
| 1 |
---
|
| 2 |
license: other
|
|
|
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: other
|
| 3 |
+
inference: false
|
| 4 |
---
|
| 5 |
+
|
| 6 |
+
# WizardLM: An Instruction-following LLM Using Evol-Instruct
|
| 7 |
+
|
| 8 |
+
These files are the result of merging the [delta weights](https://huggingface.co/victor123/WizardLM) with the original Llama7B model.
|
| 9 |
+
|
| 10 |
+
The code for merging is provided in the [WizardLM official Github repo](https://github.com/nlpxucan/WizardLM).
|
| 11 |
+
|
| 12 |
+
## WizardLM-7B GGML
|
| 13 |
+
|
| 14 |
+
This repo contains GGML files for WizardLM-7B for CPU inference
|
| 15 |
+
|
| 16 |
+
## Provided files
|
| 17 |
+
| Name | Quant method | Bits | Size | RAM required | Use case |
|
| 18 |
+
| ---- | ---- | ---- | ---- | ---- | ----- |
|
| 19 |
+
`WizardLM-7B.GGML.q4_0.bin` | q4_0 | 4bit | 39GB | 41GB | Superseded and not recommended |
|
| 20 |
+
`WizardLM-7B.GGML.q4_2.bin` | q4_2 | 4bit | 39GB | 41GB | Best compromise between resources, speed and quality |
|
| 21 |
+
`WizardLM-7B.GGML.q4_3.bin` | q4_3 | 4bit | 47GB | 49GB | Maximum quality, high RAM requirements and slow inference |
|
| 22 |
+
|
| 23 |
+
* The q4_0 file is provided for compatibility with older versions of llama.cpp. It has been superseded and is no longer recommended.
|
| 24 |
+
* The q4_2 file offers the best combination of performance and quality.
|
| 25 |
+
* The q4_3 file offers the highest quality, at the cost of increased RAM usage and slower inference speed.
|
| 26 |
+
|
| 27 |
+
# Original model info
|
| 28 |
+
|
| 29 |
+
Overview of Evol-Instruct
|
| 30 |
+
Evol-Instruct is a novel method using LLMs instead of humans to automatically mass-produce open-domain instructions of various difficulty levels and skills range, to improve the performance of LLMs.
|
| 31 |
+
|
| 32 |
+

|