TencentBAC
/

TBAC-VLR1-3B-preview

Image-Text-to-Text

Model card Files Files and versions

DruryXu commited on Apr 17

Commit

b1dcb39

·

verified ·

1 Parent(s): ada3a92

update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -9,8 +9,8 @@ pipeline_tag: image-text-to-text
 # TBAC-VLR1-3B-preview
 ## Overview
-This is a multimodal language model fine-tuned by Tencent PCG Basic Algorithm Center. Based on Qwen2.5-VL-3B-Instruct, TBAC-VLR1-3B-preview uses Group Relative Policy Optimization
-(GRPO) to enhance multimodal reasoning ability, achieving state-of-the-art results on several multimodal reasoning benchmarks among models of the same size.
 ## Performance
 | Model                     | **Average** | **MathVista**| **MathVision** | **MathVerse** | **DynaMath**  | **WeMath**| **LogicVista** |
@@ -97,4 +97,4 @@ If you find our model useful in your research, please consider giving ❤️ and
 **About**
-Created by the Tencent BAC Group. All rights reserved.

 # TBAC-VLR1-3B-preview
 ## Overview
+This is a multimodal language model fine-tuned by **Tencent PCG Basic Algorithm Center**. Based on Qwen2.5-VL-3B-Instruct, TBAC-VLR1-3B-preview uses Group Relative Policy Optimization
+(GRPO) to enhance multimodal reasoning ability, achieving **state-of-the-art** results on several multimodal reasoning benchmarks among models of the same size.
 ## Performance
 | Model                     | **Average** | **MathVista**| **MathVision** | **MathVerse** | **DynaMath**  | **WeMath**| **LogicVista** |
 **About**
+Created by the Tencent PCG Basic Algorithm Center. All rights reserved.