NVFP4
/

Qwen3-30B-A3B-Instruct-2507-FP4

Text Generation

Model Optimizer

8-bit precision

Model card Files Files and versions

how was this model created

#1

by koushd - opened Sep 15

koushd

Sep 15

Did you use llm-compressor in vllm or something else?

prudant

20 days ago

how do you run this model? cant with vllm / sglang

koushd

20 days ago

moe nvfp4 only seems to work with tensorrt-llm

prudant

6 days ago

can you share the quant script for this model please 🙏

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment