Kwai-Klear/Klear-AgentForge-8B-SFT
308k
•
Updated
•
9
•
3
Effective supervised fine-tuning (SFT) with synthetic data followed by multi-turn reinforcement learning (RL) for boosting agentic models.