roleplaiapp
/

DS-Distilled-Hermes-Llama-3.1_TIES-i1-IQ4_XS-GGUF

Text Generation

Model card Files Files and versions

DS-Distilled-Hermes-Llama-3.1_TIES-i1-IQ4_XS-GGUF / README.md

roleplaiapp's picture

Upload README.md with huggingface_hub

c7019fd verified 10 months ago

|

history blame contribute delete

813 Bytes

	---
	library_name: transformers
	pipeline_tag: text-generation
	tags:
	- IQ4_XS
	- distilled
	- gguf
	- hermes
	- iq4
	- llama
	- llama-cpp
	- text-generation
	- ties
	---

	# roleplaiapp/DS-Distilled-Hermes-Llama-3.1_TIES-i1-IQ4_XS-GGUF

	Repo: `roleplaiapp/DS-Distilled-Hermes-Llama-3.1_TIES-i1-IQ4_XS-GGUF`
	Original Model: `DS-Distilled-Hermes-Llama-3.1_TIES-i1`
	Quantized File: `DS-Distilled-Hermes-Llama-3.1_TIES.i1-IQ4_XS.gguf`
	Quantization: `GGUF`
	Quantization Method: `IQ4_XS`

	## Overview
	This is a GGUF IQ4_XS quantized version of DS-Distilled-Hermes-Llama-3.1_TIES-i1
	## Quantization By
	I often have idle GPUs while building/testing for the RP app, so I put them to use quantizing models.
	I hope the community finds these quantizations useful.

	Andrew Webby @ [RolePlai](https://roleplai.app/).