macandchiz commited on
Commit
81465c4
·
verified ·
1 Parent(s): fe41f65

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ license_link: https://huggingface.co/NousResearch/Hermes-4-70B/blob/main/LICENSE
5
+ pipeline_tag: text-generation
6
+ tags:
7
+ - llama-cpp
8
+ - gguf-my-repo
9
+ base_model: NousResearch/Hermes-4-70B
10
+ ---
11
+
12
+ # macandchiz/Hermes-4-70B-GGUF
13
+
14
+ ![GGUF Logo](https://cdn-avatars.huggingface.co/v1/production/uploads/67ef3f648b0b5419e4c8ba8c/L3g7LETCBD9EMQEiWFvP0.png)
15
+
16
+ Quantized version of: [NousResearch/Hermes-4-70B](https://huggingface.co/NousResearch/Hermes-4-70B)
17
+
18
+ ## Available Files
19
+
20
+ The following GGUF quantization variants are available:
21
+
22
+ - `hermes-4-70b-q2_k.gguf`
23
+ - `hermes-4-70b-q3_k_s.gguf`
24
+ - `hermes-4-70b-q3_k_m.gguf`
25
+ - `hermes-4-70b-q3_k_l.gguf`
26
+ - `hermes-4-70b-q4_0.gguf`
27
+ - `hermes-4-70b-q4_1.gguf`
28
+ - `hermes-4-70b-q4_k_s.gguf`
29
+ - `hermes-4-70b-q4_k_m.gguf`
30
+ - `hermes-4-70b-q5_0.gguf`
31
+ - `hermes-4-70b-q5_1.gguf`
32
+ - `hermes-4-70b-q5_k_s.gguf`
33
+ - `hermes-4-70b-q5_k_m.gguf`
34
+ - `hermes-4-70b-q6_k.gguf`
35
+ - `hermes-4-70b-q8_0.gguf`
36
+ - `hermes-4-70b-f16.gguf`
37
+
38
+ ## Quantization Information
39
+
40
+ - **q2_k**: Smallest size, lowest quality
41
+ - **q3_k_s, q3_k_m, q3_k_l**: Small size, low quality variants
42
+ - **q4_0, q4_1, q4_k_s, q4_k_m**: Medium size, good quality (recommended for most use cases)
43
+ - **q5_0, q5_1, q5_k_s, q5_k_m**: Larger size, better quality
44
+ - **q6_k**: Large size, high quality
45
+ - **q8_0**: Very large size, very high quality
46
+ - **f16**: Original precision (largest size)
47
+
48
+ Choose the quantization level that best fits your needs based on the trade-off between file size and model quality.