P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published Nov 17, 2025 • 134
BroRL: Scaling Reinforcement Learning via Broadened Exploration Paper • 2510.01180 • Published Oct 1, 2025 • 18
Symbolic Graphics Programming with Large Language Models Paper • 2509.05208 • Published Sep 5, 2025 • 46
Sphere Prover Collection The dataset and ckpt in Sphere-Prover-V1: Training LLM-based Prover for Formal Mathematics via Exploration-based Reinforocement Learning • 10 items • Updated Aug 21, 2025