From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation Paper • 2510.07624 • Published Oct 8, 2025 • 7