Voice Evaluation of Reasoning Ability: Diagnosing the Modality-Induced Performance Gap Paper • 2509.26542 • Published Sep 30 • 8
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play Paper • 2509.25541 • Published Sep 29 • 139