See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles Paper • 2509.13615 • Published Sep 17 • 1