Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiafei Lyu's picture
1

Jiafei Lyu

dmux
21world's profile picture BryantMcGill's profile picture
ยท
https://dmksjfl.github.io/
  • dmksjfl

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper about 16 hours ago
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control
authored a paper 8 months ago
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
authored a paper over 1 year ago
SEABO: A Simple Search-Based Method for Offline Imitation Learning
View all activity

Organizations

Tsinghua University's profile picture

dmux 's models

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs