Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
langfeng01
's Collections
TimeMaster
verl-agent
verl-agent
updated
Jun 20
Open-source models trained via GiGPO and verl-agent
Upvote
2
langfeng01/GiGPO-Qwen2.5-7B-Instruct-WebShop
8B
•
Updated
Sep 28
•
860
langfeng01/GiGPO-Qwen2.5-7B-Instruct-ALFWorld
8B
•
Updated
Sep 28
•
82
•
1
Group-in-Group Policy Optimization for LLM Agent Training
Paper
•
2505.10978
•
Published
May 16
•
18
Upvote
2
Share collection
View history
Collection guide
Browse collections