amd
/

Llama-2-70b-chat-hf-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8

Model card Files Files and versions

Llama-2-70b-chat-hf-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8

55.2 GB

2 contributors

History: 5 commits

XuebinWang's picture

copy needed files from original meta-llama (#4)

d35b802 verified about 2 months ago

.gitattributes

1.58 kB

copy needed files from original meta-llama (#4) about 2 months ago
LICENSE.txt

7.02 kB

copy needed files from original meta-llama (#4) about 2 months ago
MODEL_CARD.md

7.23 kB

copy needed files from original meta-llama (#4) about 2 months ago
NOTICE.txt

135 Bytes

copy needed files from original meta-llama (#4) about 2 months ago
README.md

2.77 kB

upload readme file (#3) about 2 months ago
Responsible-Use-Guide.pdf

1.25 MB
xet

copy needed files from original meta-llama (#4) about 2 months ago
USE_POLICY.md

4.77 kB

copy needed files from original meta-llama (#4) about 2 months ago
chat_template.jinja

815 Bytes

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
config.json

929 kB

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
generation_config.json

183 Bytes

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00001-of-00012.safetensors

4.98 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00002-of-00012.safetensors

4.92 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00003-of-00012.safetensors

4.96 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00004-of-00012.safetensors

4.93 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00005-of-00012.safetensors

4.8 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00006-of-00012.safetensors

5 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00007-of-00012.safetensors

4.98 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00008-of-00012.safetensors

4.98 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00009-of-00012.safetensors

5 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00010-of-00012.safetensors

5 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00011-of-00012.safetensors

4.83 GB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model-00012-of-00012.safetensors

759 MB
xet

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
model.safetensors.index.json

149 kB

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
special_tokens_map.json

414 Bytes

copy needed files from original meta-llama (#4) about 2 months ago
tokenizer.json

3.62 MB

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago
tokenizer.model

500 kB
xet

upload (#1) 3 months ago
tokenizer_config.json

977 Bytes

update Quark quantized Auto Mixed Precision (AMP) Llama-2-70b-chat-hf model with better accuracies (#2) about 2 months ago