Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Yan
KaiZhuo
Follow
0 followers
·
1 following
AI & ML interests
None yet
Organizations
models
3
Sort: Recently updated
KaiZhuo/Qwen_3B_RM
3B
•
Updated
Sep 4
KaiZhuo/Qwen2.5-7B-Instruct-RM-RL
Updated
Jun 29
KaiZhuo/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Sep 12, 2024
•
8
datasets
0
None public yet