yueliu1999
/

GuardReasoner-VL-Eco-7B

Image-Text-to-Text

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Fix pipeline tag and link to paper

#1

by nielsr HF Staff - opened Jun 8

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +21 -15

README.md CHANGED Viewed

@@ -1,28 +1,28 @@
 ---
-license: apache-2.0
 language:
 - en
 - zh
 tags:
 - llama-factory
 - easy-r1
 - full
 - generated_from_trainer
-metrics:
-- f1
-base_model:
-- Qwen/Qwen2.5-VL-7B-Instruct
 model-index:
-- name: GuardReasoner-VL-Eco-7B
   results: []
-pipeline_tag: text-classification
-library_name: transformers
 ---
-# GuardReasoner-VL-Eco-7B
-This model is a fine-tuned version of [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct) via R-SFT and online RL.
-This model is based on the paper [GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning](https://arxiv.org/html/2505.11049v1).
 <!-- The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain). -->
@@ -37,7 +37,7 @@ from transformers import AutoProcessor
 from qwen_vl_utils import process_vision_info
 parser = argparse.ArgumentParser(description="GuardReasoner-VL Inference")
-parser.add_argument("--model_path", type=str, default="yueliu1999/GuardReasoner-VL-Eco-7B", help="model path")
 parser.add_argument("--benchmark_path", type=str, default="./data/benchmark/", help="benchmark path")
 args = parser.parse_args()
@@ -152,17 +152,23 @@ messages = [
 case3_res = generate(messages)
-print("case1:\n\n")
 print("-"*30)
 print(case1_res)
 print("-"*30)
-print("case2:\n\n")
 print("-"*30)
 print(case2_res)
 print("-"*30)
-print("case3:\n\n")
 print("-"*30)
 print(case3_res)
 print("-"*30)

 ---
+base_model:
+- Qwen/Qwen2.5-VL-3B-Instruct
 language:
 - en
 - zh
+library_name: transformers
+license: apache-2.0
+metrics:
+- f1
+pipeline_tag: image-text-to-text
 tags:
 - llama-factory
 - easy-r1
 - full
 - generated_from_trainer
 model-index:
+- name: GuardReasoner-VL-3B
   results: []
 ---
+# GuardReasoner-VL-3B
+This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct) via R-SFT and online RL.
+This model is based on the paper [GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning](https://huggingface.co/papers/2505.11049).
 <!-- The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain). -->
 from qwen_vl_utils import process_vision_info
 parser = argparse.ArgumentParser(description="GuardReasoner-VL Inference")
+parser.add_argument("--model_path", type=str, default="yueliu1999/GuardReasoner-VL-3B", help="model path")
 parser.add_argument("--benchmark_path", type=str, default="./data/benchmark/", help="benchmark path")
 args = parser.parse_args()
 case3_res = generate(messages)
+print("case1:
+")
 print("-"*30)
 print(case1_res)
 print("-"*30)
+print("case2:
+")
 print("-"*30)
 print(case2_res)
 print("-"*30)
+print("case3:
+")
 print("-"*30)
 print(case3_res)
 print("-"*30)