WYF3634076 commited on
Commit
8002acc
·
verified ·
1 Parent(s): f7b72c0

Update README.md

Browse files

vllm model card update

Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -109,12 +109,12 @@ print("result:", result)
109
 
110
  ```bash
111
  # 80G * 16 GPU
112
- vllm serve baidu/ERNIE-4.5-300B-A47B-Base-PT --trust-remote-code
113
  ```
114
 
115
  ```bash
116
- # FP8 online quantification 80G * 16 GPU
117
- vllm serve baidu/ERNIE-4.5-300B-A47B-Base-PT --trust-remote-code --quantization fp8
118
  ```
119
 
120
  ## License
 
109
 
110
  ```bash
111
  # 80G * 16 GPU
112
+ vllm serve baidu/ERNIE-4.5-300B-A47B-Base-PT --tensor-parallel-size 16
113
  ```
114
 
115
  ```bash
116
+ # FP8 online quantification 80G * 8 GPU
117
+ vllm serve baidu/ERNIE-4.5-300B-A47B-Base-PT --tensor-parallel-size 8 --quantization fp8
118
  ```
119
 
120
  ## License