Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty Paper • 2507.16806 • Published Jul 22 • 6 • 1