Running on CPU Upgrade 1.47k 1.47k The Smol Training Playbook: The Secrets to Building World-Class LLMs ๐
Kimi Linear: An Expressive, Efficient Attention Architecture Paper โข 2510.26692 โข Published 6 days ago โข 93
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques ๐ ๐ By Isayoften โข Aug 26, 2024 โข 78
Demystifying Reinforcement Learning in Agentic Reasoning Paper โข 2510.11701 โข Published 23 days ago โข 31
A Survey of Reinforcement Learning for Large Reasoning Models Paper โข 2509.08827 โข Published Sep 10 โข 186
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning Paper โข 2510.06217 โข Published 29 days ago โข 62