-
-
-
-
-
-
Inference Providers
Active filters:
awq
UCLA-EMC/Meta-Llama-3.1-8B-Instruct-AWQ-INT4-32-2.17B
Text Generation
•
8B
•
Updated
•
5
•
1
Skhaled99/SeaLLM-v3-7B-Chat-AWQ
8B
•
Updated
•
2
reach-vb/Meta-Llama-3.1-8B-Instruct-AWQ-INT4-fix
Text Generation
•
8B
•
Updated
•
8
VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct-awq
8B
•
Updated
•
45
•
3
phamtungthuy/Llama-3-8B-Instruct-AWQ
8B
•
Updated
•
4
solidrust/dolphin-2.9.4-llama3.1-8b-AWQ
Text Generation
•
8B
•
Updated
•
57
•
3
magnifi/parser_user_v18a_epoch_7_lr_0p002_awq
4B
•
Updated
•
4
sergeipetrov/InternVL2-8B-AWQ-qconfig
Image-Text-to-Text
•
Updated
•
6
kamalgodar/TinyLlama-1.1B-Chat-v1.0-AWQ
1B
•
Updated
•
1
abhinav-2k23/RAG-Llama-3.1-8B-AWQ-4bit_v4
8B
•
Updated
•
3
Weni/Llama-Guard-3-8B-AWQ
8B
•
Updated
•
74
•
1
PrunaAI/ibm-granite-granite-8b-code-instruct-128k-AWQ-4bit-smashed
8B
•
Updated
•
7
Xu-Ouyang/Qwen2-0.5B-int4-AWQ
Text Generation
•
0.5B
•
Updated
•
8
informatiker/Hermes-3-Llama-3.1-8B-AWQ-INT4
8B
•
Updated
•
32
sensorqa/sensist_gpt_shortened_lora_unloaded_awq
7B
•
Updated
•
6
sensorqa/sensist_choices_lora_unloaded_awq
7B
•
Updated
•
4
phamvanlinh143/Qwen2-1.5B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
2
phamvanlinh143/opt-125m-awq
Text Generation
•
0.2B
•
Updated
•
3
arcee-ai/deepseek-v2-chat-0628-awq
236B
•
Updated
•
54
•
6
TechxGenus/SmolLM-135M-Instruct-AWQ
Text Generation
•
0.2B
•
Updated
•
3
ghost-x/ghost-8b-beta-1608-awq
Text Generation
•
8B
•
Updated
•
12
hiuman/llama-3.1-8B-intruct-awq-quantization
8B
•
Updated
•
2
harborwater/Gemma-2-9B-It-SPPO-Iter3-AWQ
Text Generation
•
10B
•
Updated
•
4
denemeacc/Llama-2-7b-llm-awq-4bit
denemeacc/Llama-2-7b-chat-hf-llm-awq
magnifi/parser_user_v19a_epoch_7_lr_0p002_awq
4B
•
Updated
•
2
magnifi/Phi3_intent_v31_2_epoch_2_lr_0p002_awq
4B
•
Updated
•
5
jester6136/Phi-3.5-mini-instruct-awq
4B
•
Updated
•
7
jburmeister/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
8B
•
Updated
•
2