Update inference-cache-config/trn1/mixtral.json 8343560 verified dacorvo HF Staff commited on 16 days ago
Update inference-cache-config/trn1/mixtral.json e64396b verified dacorvo HF Staff commited on 16 days ago
Delete inference-cache-config/mistral-variants.json 3551ea0 verified dacorvo HF Staff commited on 17 days ago
Update inference-cache-config/llama-variants.json a510ca8 verified dacorvo HF Staff commited on 24 days ago
Rename inference-cache-config/qwen3-moe.json to inference-cache-config/qwen-moe.json 24ae643 verified dacorvo HF Staff commited on Sep 2
Add batch size 4 configurations for LLama 1B and 3B models 3b6312a verified dacorvo HF Staff commited on Jun 25
Rename inference-cache-config/pixart_sigma_xl_512x512.json to inference-cache-config/pixart-sigma-xl-512x512.json 1d662ce verified Jingya HF Staff commited on Jun 22
Rename inference-cache-config/pixart-xl-2-512x512.json to inference-cache-config/pixart-alpha-xl-512x512.json cb11624 verified Jingya HF Staff commited on Jun 22
Rename inference-cache-config/pixArt-XL-2-512x512.json to inference-cache-config/pixart-xl-2-512x512.json c7f992d verified Jingya HF Staff commited on Jun 22
Rename inference-cache-config/diffusion.json to inference-cache-config/stable-diffusion-v1-5.json 4a034bb verified Jingya HF Staff commited on Jun 22
Update inference-cache-config/qwen2.5-large.json 84982b8 verified dacorvo HF Staff commited on Jan 28
Rename inference-cache-config/qwen-2.5-large.json to inference-cache-config/qwen2.5-large.json 2aa52ac verified dacorvo HF Staff commited on Dec 4, 2024
Rename inference-cache-config/qwen2.5 to inference-cache-config/qwen2.5.json b9f1fde verified dacorvo HF Staff commited on Dec 4, 2024
Add qwen2.5 config for models up to 14B params 4e25bb0 verified dacorvo HF Staff commited on Dec 4, 2024