view article Article OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve By codelion • May 20 • 49
⚔️ BigCodeArena Collection Unveiling More Reliable Human Preferences in Code Generation via Execution • 8 items • Updated 26 days ago • 5
view article Article BigCodeArena: Judging code generations end to end with code executions By bigcode • Oct 7 • 17
miniCTX Collection miniCTX: Neural Theorem Proving with (Long-)Contexts (ICLR 2025 Oral) • 8 items • Updated Mar 19 • 2
L1 Collection L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 7 items • Updated Jul 13 • 8