Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wei Fu's picture
3 1

Wei Fu

garrett4wade
RichardBian's profile picture 21world's profile picture
·
  • garrett4wade

AI & ML interests

RL

Recent Activity

liked a dataset about 1 month ago
inclusionAI/ASearcher-train-data
upvoted a paper 3 months ago
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL
upvoted a paper 5 months ago
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments
View all activity

Organizations

None yet

authored 2 papers over 1 year ago

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Paper • 2306.16688 • Published Jun 29, 2023

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Paper • 2404.10719 • Published Apr 16, 2024 • 6
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs