Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dongguanting
's Collections
AEPO
ARPO
Tool-Star
RAG-Critic
AEPO
updated
Oct 21
The official datasets and model checkpoints of AEPO
Upvote
4
Agentic Entropy-Balanced Policy Optimization
Paper
•
2510.14545
•
Published
Oct 16
•
103
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
Oct 27
•
7
•
1
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
•
15B
•
Updated
Oct 21
•
9
•
1
dongguanting/Qwen2.5-7B-AEPO
Text Generation
•
8B
•
Updated
Oct 27
•
16
•
1
Upvote
4
Share collection
View history
Collection guide
Browse collections