Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention Paper • 2510.04212 • Published Oct 5 • 22
Large Reasoning Models Learn Better Alignment from Flawed Thinking Paper • 2510.00938 • Published Oct 1 • 57
Beyond the Surface: Probing the Ideological Depth of Large Language Models Paper • 2508.21448 • Published Aug 29
Testing Conviction: An Argumentative Framework for Measuring LLM Political Stability Paper • 2504.17052 • Published Apr 23
Visual Representation Alignment for Multimodal Large Language Models Paper • 2509.07979 • Published Sep 9 • 83
Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers Paper • 2509.06493 • Published Sep 8 • 11