Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR Paper • 2508.14029 • Published Aug 19 • 118
From System 1 to System 2: A Survey of Reasoning Large Language Models Paper • 2502.17419 • Published Feb 24 • 3
Spurious Forgetting in Continual Learning of Language Models Paper • 2501.13453 • Published Jan 23 • 1
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Paper • 2309.14316 • Published Sep 25, 2023 • 8