DruryXu commited on
Commit
b1dcb39
·
verified ·
1 Parent(s): ada3a92

update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -9,8 +9,8 @@ pipeline_tag: image-text-to-text
9
  # TBAC-VLR1-3B-preview
10
 
11
  ## Overview
12
- This is a multimodal language model fine-tuned by Tencent PCG Basic Algorithm Center. Based on Qwen2.5-VL-3B-Instruct, TBAC-VLR1-3B-preview uses Group Relative Policy Optimization
13
- (GRPO) to enhance multimodal reasoning ability, achieving state-of-the-art results on several multimodal reasoning benchmarks among models of the same size.
14
 
15
  ## Performance
16
  | Model | **Average** | **MathVista**| **MathVision** | **MathVerse** | **DynaMath** | **WeMath**| **LogicVista** |
@@ -97,4 +97,4 @@ If you find our model useful in your research, please consider giving ❤️ and
97
 
98
  **About**
99
 
100
- Created by the Tencent BAC Group. All rights reserved.
 
9
  # TBAC-VLR1-3B-preview
10
 
11
  ## Overview
12
+ This is a multimodal language model fine-tuned by **Tencent PCG Basic Algorithm Center**. Based on Qwen2.5-VL-3B-Instruct, TBAC-VLR1-3B-preview uses Group Relative Policy Optimization
13
+ (GRPO) to enhance multimodal reasoning ability, achieving **state-of-the-art** results on several multimodal reasoning benchmarks among models of the same size.
14
 
15
  ## Performance
16
  | Model | **Average** | **MathVista**| **MathVision** | **MathVerse** | **DynaMath** | **WeMath**| **LogicVista** |
 
97
 
98
  **About**
99
 
100
+ Created by the Tencent PCG Basic Algorithm Center. All rights reserved.