Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dongguanting 's Collections
AEPO
ARPO
Tool-Star
RAG-Critic

AEPO

updated Oct 21

The official datasets and model checkpoints of AEPO

Upvote
4

  • Agentic Entropy-Balanced Policy Optimization

    Paper • 2510.14545 • Published Oct 16 • 103

  • dongguanting/Qwen3-8B-AEPO-DeepSearch

    Text Generation • 8B • Updated Oct 27 • 7 • 1

  • dongguanting/Qwen3-14B-AEPO-DeepSearch

    Robotics • 15B • Updated Oct 21 • 9 • 1

  • dongguanting/Qwen2.5-7B-AEPO

    Text Generation • 8B • Updated Oct 27 • 16 • 1
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs