Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
amd
/
Llama-2-70b-chat-hf-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8
like
0
Follow
AMD
2.08k
Safetensors
llama
quark
License:
llama2
Model card
Files
Files and versions
xet
Community
4
main
Llama-2-70b-chat-hf-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8
55.2 GB
2 contributors
History:
5 commits
XuebinWang
copy needed files from original meta-llama (
#4
)
d35b802
verified
about 2 months ago
.gitattributes
Safe
1.58 kB
copy needed files from original meta-llama (#4)
about 2 months ago
LICENSE.txt
Safe
7.02 kB
copy needed files from original meta-llama (#4)
about 2 months ago
MODEL_CARD.md
Safe
7.23 kB
copy needed files from original meta-llama (#4)
about 2 months ago
NOTICE.txt
135 Bytes
copy needed files from original meta-llama (#4)
about 2 months ago
README.md
2.77 kB
upload readme file (#3)
about 2 months ago
Responsible-Use-Guide.pdf
Safe
1.25 MB
xet
copy needed files from original meta-llama (#4)
about 2 months ago
USE_POLICY.md
Safe
4.77 kB
copy needed files from original meta-llama (#4)
about 2 months ago
chat_template.jinja
Safe
815 Bytes
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
config.json
929 kB
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
generation_config.json
Safe
183 Bytes
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00001-of-00012.safetensors
4.98 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00002-of-00012.safetensors
4.92 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00003-of-00012.safetensors
4.96 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00004-of-00012.safetensors
4.93 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00005-of-00012.safetensors
4.8 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00006-of-00012.safetensors
5 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00007-of-00012.safetensors
4.98 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00008-of-00012.safetensors
4.98 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00009-of-00012.safetensors
5 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00010-of-00012.safetensors
5 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00011-of-00012.safetensors
4.83 GB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model-00012-of-00012.safetensors
759 MB
xet
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
model.safetensors.index.json
149 kB
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
special_tokens_map.json
Safe
414 Bytes
copy needed files from original meta-llama (#4)
about 2 months ago
tokenizer.json
Safe
3.62 MB
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago
tokenizer.model
Safe
500 kB
xet
upload (#1)
3 months ago
tokenizer_config.json
Safe
977 Bytes
update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2)
about 2 months ago