Add pipeline_tag, library_name, and direct paper link

This PR enhances the model card for `MedVLSynther-3B-RL_10K` by adding key metadata and improving content discoverability:

* **`pipeline_tag: image-text-to-text`**: This categorizes the model for multimodal visual question answering, ensuring it appears under the correct pipeline filter on the Hugging Face Hub.
* **`library_name: transformers`**: Evidence from the `Usage` section and `config.json` confirms compatibility with the 🤗 Transformers library, enabling the automated "how to use" widget on the model page.
* **Direct Paper Link**: A direct link to the paper on the Hugging Face Hub (`https://huggingface.co/papers/2510.25867`) has been added at the top of the model card for easier access to the research.

These updates improve the model's discoverability and user experience on the Hub.

Files changed (1) hide show

README.md +12 -6

README.md CHANGED Viewed

@@ -1,15 +1,19 @@
 ---
-license: apache-2.0
 datasets:
 - MedVLSynther/MedSynVQA-10K
 language:
 - en
-base_model:
-- Qwen/Qwen2.5-VL-3B-Instruct
 ---
 # MedVLSynther-3B-RL_10K
 Code: https://github.com/UCSC-VLAA/MedVLSynther
 Project Page: https://ucsc-vlaa.github.io/MedVLSynther/
@@ -47,7 +51,8 @@ processor = AutoProcessor.from_pretrained(model_name)
 messages_1 = [
     {
         "role": "system",
-        "content": "You will solve a problem/request. You should provide your thoughts within <think> </think> tags before providing the answer.\nWrite your final answer within <answer> </answer> tags.",
     },
     {
         "role": "user",
@@ -64,7 +69,8 @@ messages_1 = [
 messages_2 = [
     {
         "role": "system",
-        "content": "You will solve a problem/request. You should provide your thoughts within <think> </think> tags before providing the answer.\nWrite your final answer within <answer> </answer> tags.",
     },
     {
         "role": "user",
@@ -72,7 +78,7 @@ messages_2 = [
             {
                 "type": "image",
                 "image": "assets/7bslake.png",
-            },
             {"type": "text", "text": "Does the picture contain kidney? Choices: (A) Yes (B) No"},
         ],
     }

 ---
+base_model:
+- Qwen/Qwen2.5-VL-3B-Instruct
 datasets:
 - MedVLSynther/MedSynVQA-10K
 language:
 - en
+license: apache-2.0
+pipeline_tag: image-text-to-text
+library_name: transformers
 ---
 # MedVLSynther-3B-RL_10K
+This model is presented in the paper [MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs](https://huggingface.co/papers/2510.25867).
 Code: https://github.com/UCSC-VLAA/MedVLSynther
 Project Page: https://ucsc-vlaa.github.io/MedVLSynther/
 messages_1 = [
     {
         "role": "system",
+        "content": "You will solve a problem/request. You should provide your thoughts within <think> </think> tags before providing the answer.
+Write your final answer within <answer> </answer> tags.",
     },
     {
         "role": "user",
 messages_2 = [
     {
         "role": "system",
+        "content": "You will solve a problem/request. You should provide your thoughts within <think> </think> tags before providing the answer.
+Write your final answer within <answer> </answer> tags.",
     },
     {
         "role": "user",
             {
                 "type": "image",
                 "image": "assets/7bslake.png",
+            },\
             {"type": "text", "text": "Does the picture contain kidney? Choices: (A) Yes (B) No"},
         ],
     }