-
-
-
-
-
-
Inference Providers
Active filters:
prm, trl
qgallouedec/Qwen2-0.5B-Reward
Token Classification
•
0.5B
•
Updated
•
13
plaguss/Qwen2.5-Math-7B-PRM-0.1
Token Classification
•
7B
•
Updated
•
14
plaguss/Qwen2.5-Math-7B-Instruct-PRM-0.1
Token Classification
•
7B
•
Updated
•
8
plaguss/Qwen2.5-Math-1.5B-Instruct-PRM-0.1
Token Classification
•
2B
•
Updated
•
10
HuggingFaceH4/Qwen2.5-Math-1.5B-Instruct-PRM-0.2
Token Classification
•
2B
•
Updated
•
37
HuggingFaceH4/Qwen2.5-Math-7B-Instruct-PRM-0.2
Token Classification
•
7B
•
Updated
•
38
Token Classification
•
66.4M
•
Updated
•
9
MikeMpapa/TraseSystem-orm-codeblob-verifier
Token Classification
•
0.5B
•
Updated
•
3
smohammadi/Qwen2.5-3B-MathShepherd
Token Classification
•
3B
•
Updated
•
3
axolotl-ai-co/Qwen2.5-Math-PRM-7B
Token Classification
•
7B
•
Updated
•
10
•
1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V3
Token Classification
•
0.5B
•
Updated
•
11
alothomas/Qwen2.5-3B-PRM-RAD-balanced-V3
Token Classification
•
3B
•
Updated
•
6
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V4
Token Classification
•
0.5B
•
Updated
•
16
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k
Token Classification
•
0.5B
•
Updated
•
111
alothomas/Qwen2.5-3B-PRM-RAD-balanced-150k
Token Classification
•
3B
•
Updated
•
10
hzy/Qwen2.5-Math-7B-Instruct-PRM-Modified-math_shepherd
Token Classification
•
7B
•
Updated
•
13
jacopo-minniti/uats-value-model
Token Classification
•
2B
•
Updated
•
2
jacopo-minniti/Qwen2.5-Math-7B-PUM
Token Classification
•
7B
•
Updated
•
4
jacopo-minniti/Qwen2.5-Math-7B-PUM-half_entropy
Token Classification
•
7B
•
Updated
•
3
jacopo-minniti/Qwen2.5-Math-7B-PUM-soft-classification
2B
•
Updated
•
7
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k-LastStepOnly
Token Classification
•
0.5B
•
Updated
•
4
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-variance
2B
•
Updated
•
13
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-binary-variance
Token Classification
•
2B
•
Updated
•
7
yungshun317/qwen2.5-0.5B-prm-mathshepherd
Token Classification
•
0.5B
•
Updated
•
4
jacopo-minniti/R1-Qwen-MMLU-1.5B-PUM-Variance
2B
•
Updated
•
186
jacopo-minniti/R1-Qwen-MMLU-1.5B-PRM
2B
•
Updated
•
49
jacopo-minniti/R1-Qwen-MMLU-1.5B-PRM-Regression
2B
•
Updated
•
120
ZaandaTeika/Qwen2.5-Math-7B-Instruct-SHARP-Math-PRM
Token Classification
•
7B
•
Updated
•
5
ZaandaTeika/Qwen2.5-Math-1.5B-Instruct-SHARP-Math-PRM
Token Classification
•
2B
•
Updated
•
10