Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
joo0405
's Collections
rl
Llm
Llm
updated
May 10
Upvote
-
Phi-4-reasoning Technical Report
Paper
•
2504.21318
•
Published
Apr 30
•
53
Flow-GRPO: Training Flow Matching Models via Online RL
Paper
•
2505.05470
•
Published
May 8
•
86
Upvote
-
Share collection
View history
Collection guide
Browse collections