Wei Fu's picture

3 1

Wei Fu

garrett4wade

·

garrett4wade

AI & ML interests

RL

Recent Activity

liked a dataset about 1 month ago

inclusionAI/ASearcher-train-data

upvoted a paper 3 months ago

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

upvoted a paper 5 months ago

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

View all activity

Organizations

None yet

authored 2 papers over 1 year ago

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Paper • 2306.16688 • Published Jun 29, 2023

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Paper • 2404.10719 • Published Apr 16, 2024 • 6