Running on CPU Upgrade 2.24k 2.24k The Smol Training Playbook 📚 The secrets to building world-class LLMs
Reward Models 10-2025 Collection A collection of great reward models for research and production • 7 items • Updated 1 day ago • 7
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design Paper • 2506.04734 • Published Jun 5 • 20