Klear-AgentForge - a Kwai-Klear Collection

Kwai-Klear 's Collections

mini-swe-agent-plus

Klear-AgentForge

KlearReasoner-8B

RLEP

Klear-AgentForge

updated 9 days ago

Effective supervised fine-tuning (SFT) with synthetic data followed by multi-turn reinforcement learning (RL) for boosting agentic models.

Kwai-Klear/Klear-AgentForge-8B-SFT

308k • Updated about 1 month ago • 9 • 3
Kwai-Klear/SWE-smith-mini_swe_agent_plus-trajectories-66k

Viewer • Updated 16 days ago • 66k • 841 • 8
Kwai-Klear/Klear-AgentForge-8B

8B • Updated 10 days ago • 12 • 1