optimum-neuron-cache / inference-cache-config
19.5 kB
dacorvo's picture
dacorvo HF Staff
Update inference-cache-config/qwen-moe.json
51619c0 verified