Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

196

Full-text search

Active filters: autoquant

Volko76/Qwen2.5-Coder-1.5B-Instruct-5.5bpw-exl2

Text Generation • Updated Nov 15, 2024

HoneyBadger2989/Llama-3-Groq-8B-Tool-Use-GGUF

Text Generation • 8B • Updated Nov 5, 2024 • 72 • 1

Volko76/Qwen2.5-Coder-0.5B-Instruct-GGUF

Text Generation • 0.5B • Updated Nov 11, 2024 • 44

Volko76/Qwen2.5-Coder-0.5B-Instruct-4.5bpw-exl2

Text Generation • Updated Nov 12, 2024

Volko76/Qwen2.5-Coder-0.5B-Instruct-1.0bpw-exl2

Text Generation • Updated Nov 12, 2024

Volko76/Qwen2.5-Coder-0.5B-Instruct-2.0bpw-exl2

Text Generation • Updated Nov 12, 2024

Volko76/Qwen2.5-Coder-0.5B-Instruct-3.0bpw-exl2

Text Generation • Updated Nov 12, 2024

Volko76/Qwen2.5-Coder-0.5B-Instruct-8.0bpw-exl2

Text Generation • Updated Nov 12, 2024 • 1

Volko76/Qwen2.5-Coder-3B-Instruct-GGUF

Text Generation • 3B • Updated Nov 12, 2024 • 55

Volko76/Qwen2.5-Coder-1.5B-Instruct-GGUF

Text Generation • 2B • Updated Nov 12, 2024 • 188

jgchaparro/language_garden-spa-tsd-8B-GGUF

8B • Updated Nov 15, 2024 • 27

Volko76/OpenCoder-1.5B-Instruct-GGUF

Text Generation • 2B • Updated Nov 18, 2024 • 78

Volko76/OpenCoder-1.5B-Base-GGUF

Text Generation • 2B • Updated Nov 18, 2024 • 82

Volko76/Qwen2.5-Coder-0.5B-GGUF

Text Generation • 0.5B • Updated Apr 27 • 167

Volko76/Qwen2.5-Coder-1.5B-GGUF

Text Generation • 2B • Updated Apr 27 • 190

saul95/Llama-3.2-1B-GGUF

Text Generation • 1B • Updated Nov 19, 2024 • 14

saul95/Llama-3.2-1B-4.5bpw-exl2

Text Generation • Updated Nov 19, 2024

saul95/Llama-3.2-1B-GPTQ

Text Generation • 0.4B • Updated Nov 19, 2024 • 87

Volko76/Qwen2.5-Coder-3B-GGUF

Text Generation • 3B • Updated Apr 27 • 78

Volko76/Qwen2.5-Coder-7B-GGUF

Text Generation • 8B • Updated Apr 27 • 107

Volko76/OpenCoder-8B-Instruct-GGUF

Text Generation • 8B • Updated Nov 22, 2024 • 20

doktorb/spydazweb_ai_humanai_010_chat.Q6_K.gguf

7B • Updated Dec 3, 2024 • 5

AMead10/Virtuoso-Small-AWQ

3B • Updated Dec 3, 2024 • 1

doktorb/daredevil-8b-abliterated.Q6_K.gguf

8B • Updated Dec 5, 2024 • 8

xiaojingyan/lora_model-GGUF

Updated Dec 7, 2024

xiaojingyan/lora_model-4.5bpw-exl2

Updated Dec 7, 2024

doktorb/Configurable-Llama-3.1-8B-Instruct-GGUF

8B • Updated Dec 10, 2024 • 7.61k

jgchaparro/language_garden-tsd-8B-GGUF

8B • Updated Jan 1 • 11

Crowno/L3.1-EtherealRainbow-v1.0-rc1-8B-6.5bpw-exl2

Text Generation • Updated Jan 10

Crowno/L3.1-EtherealRainbow-v1.0-rc1-8B-8.0bpw-exl2

Text Generation • Updated Jan 10 • 1