ICLR2025 OmniSep: https://omnisep.github.io/
Separate audio sources using text, images, and audio queries