3h_qwen2_vl-7b_GT_as_chosen_exp_1-2 / training_rewards_accuracies.png

Commit History

Add model files
0472f11

haohaihong commited on