Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
CodeGoat24
/
UnifiedReward-Think-qwen-7b
like
3
Safetensors
9 datasets
qwen2_5_vl
arxiv:
2505.03318
License:
mit
Model card
Files
Files and versions
xet
Community
1
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
Update model card for CodeGoat24/UnifiedReward-Think-qwen-7b (Pref-GRPO reward model)
#1 opened 3 months ago by
nielsr