PaddleOCR-VL-0.9B is now officially supported on vLLM

Browse files

Files changed (1) hide show

README.md +23 -11

README.md CHANGED Viewed

@@ -72,9 +72,10 @@ PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vi
 ## News
-* ```2025.10.16``` 🚀 We release [PaddleOCR-VL](https://github.com/PaddlePaddle/PaddleOCR), — a multilingual documents parsing via a 0.9B Ultra-Compact Vision-Language Model with SOTA performance.
-* ```2025.10.29``` Supports calling the core module PaddleOCR-VL-0.9B of PaddleOCR-VL via the `transformers` library.
 ## Usage
@@ -113,15 +114,25 @@ for res in output:
 ### Accelerate VLM Inference via Optimized Inference Servers
-1. Start the VLM inference server (the default port is `8080`):
-    ```bash
-    docker run \
-        --rm \
-        --gpus all \
-        --network host \
-        ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlepaddle/paddlex-genai-vllm-server
-    ```
 2. Call the PaddleOCR CLI or Python API:
     ```bash
@@ -130,6 +141,7 @@ for res in output:
         --vl_rec_backend vllm-server \
         --vl_rec_server_url http://127.0.0.1:8080/v1
     ```
     ```python
     from paddleocr import PaddleOCRVL
     pipeline = PaddleOCRVL(vl_rec_backend="vllm-server", vl_rec_server_url="http://127.0.0.1:8080/v1")
@@ -346,4 +358,4 @@ If you find PaddleOCR-VL helpful, feel free to give us a star and citation.
       primaryClass={cs.CV},
       url={https://arxiv.org/abs/2510.14528},
 }
-```

 ## News
+* ```2025.11.04``` 🥳 PaddleOCR-VL-0.9B is now officially supported on `vLLM`.
+* ```2025.10.29``` Supports calling the core module PaddleOCR-VL-0.9B of PaddleOCR-VL via the `transformers` library.
+* ```2025.10.16``` 🚀 We release [PaddleOCR-VL](https://github.com/PaddlePaddle/PaddleOCR), — a multilingual documents parsing via a 0.9B Ultra-Compact Vision-Language Model with SOTA performance.
 ## Usage
 ### Accelerate VLM Inference via Optimized Inference Servers
+1. Start the VLM inference server:
+    You can start the vLLM inference service using one of two methods:
+    - Method1: PaddleOCR method
+        ```bash
+        docker run \
+            --rm \
+            --gpus all \
+            --network host \
+            ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlepaddle/paddleocr-genai-vllm-server:latest \
+            paddleocr genai_server --model_name PaddleOCR-VL-0.9B --host 0.0.0.0 --port 8080 --backend vllm
+        ```
+    - Method2: vLLM method
+        [vLLM: PaddleOCR-VL Usage Guide](https://docs.vllm.ai/projects/recipes/en/latest/PaddlePaddle/PaddleOCR-VL.html)
 2. Call the PaddleOCR CLI or Python API:
     ```bash
         --vl_rec_backend vllm-server \
         --vl_rec_server_url http://127.0.0.1:8080/v1
     ```
     ```python
     from paddleocr import PaddleOCRVL
     pipeline = PaddleOCRVL(vl_rec_backend="vllm-server", vl_rec_server_url="http://127.0.0.1:8080/v1")
       primaryClass={cs.CV},
       url={https://arxiv.org/abs/2510.14528},
 }
+```