--- language: - en license: openrail++ pipeline_tag: image-to-image library_name: diffusers tags: - intrinsic decomposition - image analysis - computer vision - in-the-wild - zero-shot pinned: true ---

Marigold Intrinsic Image Decomposition (IID) Appearance v1-1 Model Card

Image IID diffusers Github Website arXiv Social License

This is a model card for the `marigold-iid-appearance-v1-1` model for single-image Intrinsic Image Decomposition (IID). The model is fine-tuned from the `stable-diffusion-2` [model](https://huggingface.co/stabilityai/stable-diffusion-2) as described in our papers: - [CVPR'2024 paper](https://hf.co/papers/2312.02145) titled "Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation" - [Journal extension](https://hf.co/papers/2505.09358) titled "Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis" ### Using the model This model type (`appearance`) is trained to perform InteriorVerse decomposition into **Albedo** and two **BRDF material** properties: **roughness** and **metallicity**. Both the input image and the output albedo are in the sRGB color space. For an alternative model type (`lighting`) that performs decomposition into Albedo, Diffuse shading, and Non-diffuse residual, click [here](https://huggingface.co/prs-eth/marigold-iid-lighting-v1-1). - Play with the interactive [Hugging Face Spaces demo](https://huggingface.co/spaces/prs-eth/marigold-iid): check out how the model works with example images or upload your own. - Use it with [diffusers](https://huggingface.co/docs/diffusers/using-diffusers/marigold_usage) to compute the results with a few lines of code. - Get to the bottom of things with our [official codebase](https://github.com/prs-eth/marigold). ## Model Details - **Developed by:** [Bingxin Ke](http://www.kebingxin.com/), [Kevin Qu](https://ch.linkedin.com/in/kevin-qu-b3417621b), [Tianfu Wang](https://tianfwang.github.io/), [Nando Metzger](https://nandometzger.github.io/), [Shengyu Huang](https://shengyuh.github.io/), [Bo Li](https://www.linkedin.com/in/bobboli0202), [Anton Obukhov](https://www.obukhov.ai/), [Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ). - **Model type:** Generative latent diffusion-based intrinsic image decomposition (appearance: albedo, roughness, and metallicity) from a single image. - **Language:** English. - **License:** [CreativeML Open RAIL++-M License](https://huggingface.co/stabilityai/stable-diffusion-2/blob/main/LICENSE-MODEL). - **Model Description:** This model can be used to generate an estimated intrinsic image decomposition of an input image. - **Resolution**: Even though any resolution can be processed, the model inherits the base diffusion model's effective resolution of roughly **768** pixels. This means that for optimal predictions, any larger input image should be resized to make the longer side 768 pixels before feeding it into the model. - **Steps and scheduler**: This model was designed for usage with **DDIM** scheduler and between **1 and 50** denoising steps. - **Outputs**: - **Albedo**: The predicted values are between 0 and 1, sRGB space. - **Roughness and metallicity**: The predicted values are between 0 and 1, linear space. - **Uncertainty maps**: Produced for each modality only when multiple predictions are ensembled with ensemble size larger than 2. - **Resources for more information:** [Project Website](https://marigoldcomputervision.github.io/), [Paper](https://arxiv.org/abs/2505.09358), [Code](https://github.com/prs-eth/marigold). - **Cite as:** ```bibtex @misc{ke2025marigold, title={Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis}, author={Bingxin Ke and Kevin Qu and Tianfu Wang and Nando Metzger and Shengyu Huang and Bo Li and Anton Obukhov and Konrad Schindler}, year={2025}, eprint={2505.09358}, archivePrefix={arXiv}, primaryClass={cs.CV} } @InProceedings{ke2023repurposing, title={Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation}, author={Bingxin Ke and Anton Obukhov and Shengyu Huang and Nando Metzger and Rodrigo Caye Daudt and Konrad Schindler}, booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, year={2024} } ```