Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
Kangda Wei
kangdawei
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 23 hours ago
kangdawei/MMR-GRPO-8B
updated
a model
about 23 hours ago
kangdawei/Open-RS-8B
updated
a model
about 23 hours ago
kangdawei/Open-RS-DR_GRPO-8B
View all activity
Organizations
None yet
kangdawei
's models
40
Sort: Recently updated
kangdawei/MMR-GRPO-8B
Text Generation
•
8B
•
Updated
about 23 hours ago
kangdawei/Open-RS-8B
Text Generation
•
8B
•
Updated
about 23 hours ago
kangdawei/Open-RS-DR_GRPO-8B
Text Generation
•
8B
•
Updated
about 23 hours ago
kangdawei/MMR-DR_GRPO-8B
Text Generation
•
8B
•
Updated
about 23 hours ago
kangdawei/DRA-DR_GRPO-8B
Text Generation
•
8B
•
Updated
1 day ago
kangdawei/DRA-GRPO-8B
Text Generation
•
8B
•
Updated
1 day ago
kangdawei/Open-RS-DR_GRPO-7B
Text Generation
•
8B
•
Updated
3 days ago
•
49
•
1
kangdawei/Open-RS-7B
Text Generation
•
8B
•
Updated
3 days ago
•
54
kangdawei/MMR-GRPO-7B
Text Generation
•
8B
•
Updated
3 days ago
•
46
kangdawei/MMR-DR_GRPO-7B
Text Generation
•
8B
•
Updated
3 days ago
•
52
kangdawei/DRA-GRPO-7B
Text Generation
•
8B
•
Updated
3 days ago
•
39
kangdawei/DRA-DR_GRPO-7B
Text Generation
•
8B
•
Updated
4 days ago
•
5
kangdawei/MMR-DR_GRPO-OpenS1
Text Generation
•
2B
•
Updated
20 days ago
•
106
kangdawei/DRA-DR_GRPO-OpenS1
Text Generation
•
2B
•
Updated
21 days ago
•
81
kangdawei/Open-RS-DR_GRPO-OpenS1
Text Generation
•
2B
•
Updated
21 days ago
•
99
kangdawei/Open-RS-OpenS1
Text Generation
•
2B
•
Updated
21 days ago
•
73
kangdawei/DRA-GRPO-OpenS1
Text Generation
•
2B
•
Updated
22 days ago
•
42
kangdawei/MMR-GRPO-OpenS1
Text Generation
•
2B
•
Updated
23 days ago
•
64
kangdawei/MMR-DR_GRPO-lambda-0.9
Text Generation
•
2B
•
Updated
30 days ago
•
34
kangdawei/MMR-DR_GRPO-lambda-0.8
Text Generation
•
2B
•
Updated
about 1 month ago
•
43
kangdawei/Open-RS-DRGRPO
Text Generation
•
2B
•
Updated
about 1 month ago
•
62
kangdawei/DRA-DR_GRPO
Text Generation
•
2B
•
Updated
about 1 month ago
•
38
kangdawei/MMR-DR_GRPO-lambda-0.7
Text Generation
•
2B
•
Updated
about 1 month ago
•
30
kangdawei/MMR-DR_GRPO-lambda-0.6
Text Generation
•
2B
•
Updated
about 1 month ago
•
21
kangdawei/MMR-DR_GRPO-lambda-0.5
Text Generation
•
2B
•
Updated
about 1 month ago
•
22
kangdawei/MMR-Adaptive-Smooth-GRPO
Text Generation
•
2B
•
Updated
Oct 25
•
4
kangdawei/MMR-Adaptive-Smooth-DR_GRPO
2B
•
Updated
Oct 25
•
4
kangdawei/DRA-GRPO
Text Generation
•
2B
•
Updated
Oct 25
•
15
kangdawei/MMR-GRPO-lambda-0.9
Text Generation
•
2B
•
Updated
Oct 25
•
19
kangdawei/MMR-GRPO-lambda-0.7
Text Generation
•
2B
•
Updated
Oct 25
•
21
Previous
1
2
Next