baidu
/

ERNIE-4.5-300B-A47B-Base-PT

Text Generation

Model card Files Files and versions

WYF3634076 commited on Sep 1

Commit

8002acc

·

verified ·

1 Parent(s): f7b72c0

Update README.md

vllm model card update

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -109,12 +109,12 @@ print("result:", result)
 ```bash
 # 80G * 16 GPU
-vllm serve baidu/ERNIE-4.5-300B-A47B-Base-PT --trust-remote-code
 ```
 ```bash
-# FP8 online quantification 80G * 16 GPU
-vllm serve baidu/ERNIE-4.5-300B-A47B-Base-PT --trust-remote-code --quantization fp8
 ```
 ## License

 ```bash
 # 80G * 16 GPU
+vllm serve baidu/ERNIE-4.5-300B-A47B-Base-PT --tensor-parallel-size 16
 ```
 ```bash
+# FP8 online quantification 80G * 8 GPU
+vllm serve baidu/ERNIE-4.5-300B-A47B-Base-PT --tensor-parallel-size 8 --quantization fp8
 ```
 ## License