Running 3.52k The Ultra-Scale Playbook ๐ 3.52k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 2.46k The Smol Training Playbook ๐ 2.46k The secrets to building world-class LLMs
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper โข 2506.13585 โข Published Jun 16 โข 272
MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs Paper โข 2505.21327 โข Published May 27 โข 83