UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
To bridge the gap between generative flexibility and physical correctness, we propose UniLumos, a unified relighting framework for both images and videos that brings RGB-space geometry feedback into a flow-matching backbone.
π§βπ» Authors
Ropeway Liu1,2,*, Hangjie Yuan2,3,1,*β , Bo Dong2,3, Jiazheng Xing1,2,4, Jinwang Wang2,3,1, Rui Zhao4, Yan Xing2,3, Weihua Chen2,3β , Fan Wang2
1Zhejiang University, 2DAMO Academy, Alibaba Group, 3Hupan Lab, 4National University of Singapore
*Equal contributionsβββ Corresponding Author
π Click to view Abstract
Relighting is a crucial task with both practical demand and artistic value, and recent diffusion models have shown strong potential by enabling rich and controllable lighting effects. However, as they are typically optimized in semantic latent space, where proximity does not guarantee physical correctness in visual space, they often produce unrealistic resultsβsuch as overexposed highlights, misaligned shadows, and incorrect occlusions.
We address this with UniLumos, a unified relighting framework for both images and videos that brings RGB-space geometry feedback into a flow-matching backbone. By supervising the model with depth and normal maps extracted from its outputs, we explicitly align lighting effects with the scene structure, enhancing physical plausibility. Nevertheless, this feedback requires high-quality outputs for supervision in visual space, making standard multi-step denoising computationally expensive. To mitigate this, we employ path consistency learning, allowing supervision to remain effective even under few-step training regimes.
To enable fine-grained relighting control and supervision, we design a structured six-dimensional annotation protocol capturing core illumination attributes. Building upon this, we propose LumosBench, a disentangled attribute-level benchmark that evaluates lighting controllability via large vision-language models, enabling automatic and interpretable assessment of relighting precision across individual dimensions.
Extensive experiments demonstrate that UniLumos achieves state-of-the-art relighting quality with significantly improved physical consistency, while delivering a 20x speedup for both image and video relighting.
π News
[2025/9/19] Accepted by NeurIPS 2025 !
[2025/10/29] Code is available now!