Utilize HF's "balanced" device_map + dynamically pair diffusion components to relevant execution cores 48fd1f5 verified diopside commited on Sep 19