Running on CPU Upgrade 2.31k The Smol Training Playbook 📚 2.31k The secrets to building world-class LLMs
Running 3.49k The Ultra-Scale Playbook 🌌 3.49k The ultimate guide to training LLM on large GPU Clusters
VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning Paper • 2510.10518 • Published Oct 12 • 17
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar Paper • 2510.14972 • Published Oct 16 • 30
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar Paper • 2510.14972 • Published Oct 16 • 30