CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks Paper • 2507.05269 • Published Jul 3 • 1 • 1
TENET: Leveraging Tests Beyond Validation for Code Generation Paper • 2509.24148 • Published Sep 29 • 3 • 2