Serve With vLLM

#1
by faheemraza1 - opened

Can someone share the command to serve this model on an RTX 3090?

NVFP4 is only supported on Nvidia 50 series GPU.

Sign up or log in to comment