Voice Evaluation of Reasoning Ability: Diagnosing the Modality-Induced Performance Gap Paper • 2509.26542 • Published Sep 30 • 8
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play Paper • 2509.25541 • Published Sep 29 • 139
Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals Paper • 2506.02281 • Published Jun 2 • 4