Marigold Computer Vision
Collection
All things Marigold
β’
17 items
β’
Updated
β’
16
This is a model card for the marigold-depth-v1-0
model for monocular depth estimation from a single image.
The model is fine-tuned from the stable-diffusion-2
model as
described in our papers:
"timestep_spacing": "trailing"
setting
in the scheduler configuration file or by adding pipe.scheduler = DDIMScheduler.from_config(pipe.scheduler.config, timestep_spacing="trailing")
after the pipeline is loaded in the code before the first usage. For compatibility reasons we kept this v1-0
model identical to the paper setting and provided a
newer v1-1 model with optimal settings for all possible step configurations.@InProceedings{ke2023repurposing,
title={Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation},
author={Bingxin Ke and Anton Obukhov and Shengyu Huang and Nando Metzger and Rodrigo Caye Daudt and Konrad Schindler},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2024}
}
@misc{ke2025marigold,
title={Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis},
author={Bingxin Ke and Kevin Qu and Tianfu Wang and Nando Metzger and Shengyu Huang and Bo Li and Anton Obukhov and Konrad Schindler},
year={2025},
eprint={2505.09358},
archivePrefix={arXiv},
primaryClass={cs.CV}
}