|
--- |
|
license: cc-by-sa-4.0 |
|
language: |
|
- en |
|
pipeline_tag: depth-estimation |
|
tags: |
|
- monocular |
|
- depth estimation |
|
- single image depth estimation |
|
- single image |
|
- in-the-wild |
|
- zero-shot |
|
- depth |
|
--- |
|
# Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation |
|
|
|
This model represents the official checkpoint of the paper titled "Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation". |
|
|
|
[![Website](doc/badges/badge-website.svg)](https://marigoldmonodepth.github.io) |
|
[![GitHub](https://img.shields.io/github/stars/prs-eth/Marigold?style=default&label=GitHub%20β
&logo=github)](https://github.com/prs-eth/Marigold) |
|
[![Paper](doc/badges/badge-pdf.svg)](https://arxiv.org/abs/2312.02145) |
|
[![Hugging Face Space](https://img.shields.io/badge/π€%20Hugging%20Face-Space-yellow)](https://huggingface.co/spaces/toshas/marigold) |
|
[<img src="doc/badges/badge-license.svg" height="20"/>](http://creativecommons.org/licenses/by-nc-sa/4.0/) |
|
<!-- [![HF Space](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Space-blue)]() --> |
|
<!-- [![Open In Colab](doc/badges/badge-colab.svg)]() --> |
|
<!-- [![Docker](doc/badges/badge-docker.svg)]() --> |
|
<!-- ### [Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation]() --> |
|
|
|
[Bingxin Ke](http://www.kebingxin.com/), |
|
[Anton Obukhov](https://www.obukhov.ai/), |
|
[Shengyu Huang](https://shengyuh.github.io/), |
|
[Nando Metzger](https://nandometzger.github.io/), |
|
[Rodrigo Caye Daudt](https://rcdaudt.github.io/), |
|
[Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ&hl=en ) |
|
|
|
We present Marigold, a diffusion model and associated fine-tuning protocol for monocular depth estimation. Its core principle is to leverage the rich visual knowledge stored in modern generative image models. Our model, derived from Stable Diffusion and fine-tuned with synthetic data, can zero-shot transfer to unseen data, offering state-of-the-art monocular depth estimation results. |
|
|
|
![teaser](doc/teaser_collage_transparant.png) |
|
|
|
|
|
## π Citation |
|
|
|
```bibtex |
|
@misc{ke2023marigold, |
|
author = {Ke, Bingxin and Obukhov, Anton and Huang, Shengyu and Metzger, Nando and Daudt, Rodrigo Caye and Schindler, Konrad}, |
|
title = {Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation}, |
|
year = {2023}, |
|
} |
|
``` |
|
|
|
## License |
|
|
|
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. |
|
|
|
[<img src="doc/badges/badge-license.svg" height="20"/>](http://creativecommons.org/licenses/by-nc-sa/4.0/) |