Unlike dense models, a MOE model can realistically be run locally on a CPU and GPU.
Β· Sign up or log in to comment