KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation Paper • 2505.14552 • Published May 20 • 1
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 18 days ago • 253
V-GameGym: Visual Game Generation for Code Large Language Models Paper • 2509.20136 • Published Sep 24 • 9
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published Feb 20 • 104