| library_name: transformers | |
| pipeline_tag: text-generation | |
| tags: | |
| - IQ4_XS | |
| - distilled | |
| - gguf | |
| - hermes | |
| - iq4 | |
| - llama | |
| - llama-cpp | |
| - text-generation | |
| - ties | |
| # roleplaiapp/DS-Distilled-Hermes-Llama-3.1_TIES-i1-IQ4_XS-GGUF | |
| **Repo:** `roleplaiapp/DS-Distilled-Hermes-Llama-3.1_TIES-i1-IQ4_XS-GGUF` | |
| **Original Model:** `DS-Distilled-Hermes-Llama-3.1_TIES-i1` | |
| **Quantized File:** `DS-Distilled-Hermes-Llama-3.1_TIES.i1-IQ4_XS.gguf` | |
| **Quantization:** `GGUF` | |
| **Quantization Method:** `IQ4_XS` | |
| ## Overview | |
| This is a GGUF IQ4_XS quantized version of DS-Distilled-Hermes-Llama-3.1_TIES-i1 | |
| ## Quantization By | |
| I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models. | |
| I hope the community finds these quantizations useful. | |
| Andrew Webby @ [RolePlai](https://roleplai.app/). | |