Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
's Collections
DeepSeek-R1-Distill Quantized
Granite 3.1 Quantization
Sparse-Llama-3.1-2of4
Vision Language Models Quantization
FP8 LLMs for vLLM
Llama-3.2 Quantization
Llama-3.1 Quantization
INT8 LLMs for vLLM
INT4 LLMs for vLLM
Sparse Foundational Llama 2 Models
Compression Papers
DeepSparse Sparse LLMs
Sparse Finetuning MPT
Compressed LLMs from the Community
Granite 3.1 Quantization
updated
Jan 24
Upvote
-
RedHatAI/granite-3.1-2b-instruct-quantized.w4a16
Text Generation
•
0.5B
•
Updated
Feb 28
•
134
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation
•
3B
•
Updated
Feb 28
•
19
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation
•
1B
•
Updated
Sep 22
•
462
•
1
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
Sep 25
•
132
•
2
RedHatAI/granite-3.1-2b-instruct-FP8-dynamic
Text Generation
•
3B
•
Updated
Jan 28
•
35
RedHatAI/granite-3.1-8b-instruct-FP8-dynamic
Text Generation
•
8B
•
Updated
Sep 22
•
53
•
1
RedHatAI/granite-3.1-2b-base-quantized.w4a16
Text Generation
•
0.5B
•
Updated
Feb 28
•
8
RedHatAI/granite-3.1-2b-base-quantized.w8a8
Text Generation
•
3B
•
Updated
Feb 28
•
16
RedHatAI/granite-3.1-8b-base-FP8-dynamic
Text Generation
•
8B
•
Updated
Feb 20
•
1
RedHatAI/granite-3.1-2b-base-FP8-dynamic
Text Generation
•
3B
•
Updated
Jan 30
•
3
RedHatAI/granite-3.1-8b-base-quantized.w4a16
Text Generation
•
1B
•
Updated
Sep 22
•
17
•
1
RedHatAI/granite-3.1-8b-base-quantized.w8a8
Text Generation
•
8B
•
Updated
Feb 28
•
20
Upvote
-
Share collection
View history
Collection guide
Browse collections