Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kwai-Klear 's Collections
mini-swe-agent-plus
Klear-AgentForge
Klear1.0
KlearReasoner-8B
RLEP

Klear-AgentForge

updated 9 days ago

Effective supervised fine-tuning (SFT) with synthetic data followed by multi-turn reinforcement learning (RL) for boosting agentic models.

Upvote
3

  • Kwai-Klear/Klear-AgentForge-8B-SFT

    308k • Updated about 1 month ago • 9 • 3

  • Kwai-Klear/SWE-smith-mini_swe_agent_plus-trajectories-66k

    Viewer • Updated 16 days ago • 66k • 841 • 8

  • Kwai-Klear/Klear-AgentForge-8B

    8B • Updated 10 days ago • 12 • 1
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs