Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
random's picture
16 23 54

random

fakerbaby
21world's profile picture kirch's profile picture kramp's profile picture
·
  • fakerbaby

AI & ML interests

NLP, RL, VLM

Recent Activity

upvoted a paper 4 days ago
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
upvoted a paper 16 days ago
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping
liked a model 28 days ago
zai-org/GLM-4.6
View all activity

Organizations

Skywork's profile picture

Collections 1

Alignment
  • Secrets of RLHF in Large Language Models Part I: PPO

    Paper • 2307.04964 • Published Jul 11, 2023 • 29
Alignment
  • Secrets of RLHF in Large Language Models Part I: PPO

    Paper • 2307.04964 • Published Jul 11, 2023 • 29

Papers 9

arXiv:2403.07708
arXiv:2402.01391
arXiv:2401.11458
arXiv:2401.06080

spaces 2

Sleeping

Skywork R1V3

💬

Jul 25
No application file

PaI

🌖

Dec 11, 2022

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs