Submitted by Yuheng Shi 2 Catching the Details: Self-Distilled RoI Predictors for Fine-Grained MLLM Perception The University of Sydney 7 2