Commit History

Update inference-cache-config/llama.json
325c041
verified

dacorvo HF Staff commited on

Add batch size 4 configurations for LLama 1B and 3B models
3b6312a
verified

dacorvo HF Staff commited on

Added TinyLlama as requested by Jim burtoft
d9640f4
verified

dacorvo HF Staff commited on

Update inference-cache-config/llama.json
d05f579
verified

dacorvo HF Staff commited on

Update inference-cache-config/llama.json
0548cd2
verified

dacorvo HF Staff commited on

Update inference-cache-config/llama.json
afb9fe6
verified

dacorvo HF Staff commited on

Rename inference-cache-config/llama-3.1-8B.json to inference-cache-config/llama.json
14844a0
verified

dacorvo HF Staff commited on

Rename inference-cache-config/llama.json to inference-cache-config/llama2.json
f06a55a
verified

dacorvo HF Staff commited on

Add more llama config
2d87237
verified

dacorvo HF Staff commited on

Added Llama-70b batch_size 4 to inference cache
593822e
verified

dacorvo HF Staff commited on

Create inference-cache-config/llama.json
1960ccb
verified

philschmid commited on