adirik / marigold

Monocular depth estimation

Cold

Public
8K runs
L40S
GitHub
Paper
License

Iterate in playground

Run with an API

Playground API Examples README Versions

Input

image

*file

Input image, use an RGB image for optimal results.

resize_input

boolean

Resize the original input resolution to max resolution.

Default: true

num_infer

integer

(minimum: 1, maximum: 20)

Number of inferences to be ensembled, a higher number gives better results but runs slower.

Default: 10

denoise_steps

integer

(minimum: 1, maximum: 50)

Inference denoising steps, more steps results in higher accuracy but slower inference speed.

Default: 10

regularizer_strength

number

(minimum: 0, maximum: 1)

Ensembling parameter, weight of optimization regularizer.

Default: 0.02

reduction_method

string

Ensembling parameter, method to merge aligned depth maps.

Default: "median"

max_iter

integer

(minimum: 1, maximum: 20)

Ensembling parameter, max optimization iterations.

Default: 5

seed

integer

Seed for reproducibility, set to random if left as None.

Run this model in Node.js with one line of code:

npx create-replicate --model=adirik/marigold

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run adirik/marigold using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const output = await replicate.run(
  "adirik/marigold:1a363593bc4882684fc58042d19db5e13a810e44e02f8d4c32afd1eb30464818",
  {
    input: {
      image: "https://replicate.delivery/pbxt/K3HlYnhvVI5IX35KJ6RkTTzoHwEDKL7KtAiPc4F4fDvcgJX3/pete-walls-92JRuvQZfKs-unsplash_crop43.jpg",
      max_iter: 5,
      num_infer: 10,
      resize_input: true,
      denoise_steps: 10,
      reduction_method: "median",
      regularizer_strength: 0.02
    }
  }
);

// To access the file URL:
console.log(output[0].url()); //=> "http://example.com"

// To write the file to disk:
fs.writeFile("my-image.png", output[0]);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run adirik/marigold using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "adirik/marigold:1a363593bc4882684fc58042d19db5e13a810e44e02f8d4c32afd1eb30464818",
    input={
        "image": "https://replicate.delivery/pbxt/K3HlYnhvVI5IX35KJ6RkTTzoHwEDKL7KtAiPc4F4fDvcgJX3/pete-walls-92JRuvQZfKs-unsplash_crop43.jpg",
        "max_iter": 5,
        "num_infer": 10,
        "resize_input": True,
        "denoise_steps": 10,
        "reduction_method": "median",
        "regularizer_strength": 0.02
    }
)
print(output)

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run adirik/marigold using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "adirik/marigold:1a363593bc4882684fc58042d19db5e13a810e44e02f8d4c32afd1eb30464818",
    "input": {
      "image": "https://replicate.delivery/pbxt/K3HlYnhvVI5IX35KJ6RkTTzoHwEDKL7KtAiPc4F4fDvcgJX3/pete-walls-92JRuvQZfKs-unsplash_crop43.jpg",
      "max_iter": 5,
      "num_infer": 10,
      "resize_input": true,
      "denoise_steps": 10,
      "reduction_method": "median",
      "regularizer_strength": 0.02
    }
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

Output

{
  "completed_at": "2023-12-15T08:24:45.085754Z",
  "created_at": "2023-12-15T08:24:30.867733Z",
  "data_removed": false,
  "error": null,
  "id": "rwasfmdbcx25iw2kneyh67w2fy",
  "input": {
    "image": "https://replicate.delivery/pbxt/K3HlYnhvVI5IX35KJ6RkTTzoHwEDKL7KtAiPc4F4fDvcgJX3/pete-walls-92JRuvQZfKs-unsplash_crop43.jpg",
    "max_iter": 5,
    "num_infer": 10,
    "resize_input": true,
    "denoise_steps": 10,
    "reduction_method": "median",
    "regularizer_strength": 0.02
  },
  "logs": "multiple inference:   0%|          | 0/1 [00:00<?, ?it/s]\ndenoising:   0%|          | 0/10 [00:00<?, ?it/s]\u001b[A\ndenoising:  10%|█         | 1/10 [00:01<00:13,  1.55s/it]\u001b[A\ndenoising:  20%|██        | 2/10 [00:02<00:09,  1.14s/it]\u001b[A\ndenoising:  30%|███       | 3/10 [00:03<00:07,  1.01s/it]\u001b[A\ndenoising:  40%|████      | 4/10 [00:04<00:05,  1.05it/s]\u001b[A\ndenoising:  50%|█████     | 5/10 [00:04<00:04,  1.09it/s]\u001b[A\ndenoising:  60%|██████    | 6/10 [00:05<00:03,  1.12it/s]\u001b[A\ndenoising:  70%|███████   | 7/10 [00:06<00:02,  1.13it/s]\u001b[A\ndenoising:  80%|████████  | 8/10 [00:07<00:01,  1.14it/s]\u001b[A\ndenoising:  90%|█████████ | 9/10 [00:08<00:00,  1.15it/s]\u001b[A\ndenoising: 100%|██████████| 10/10 [00:09<00:00,  1.15it/s]\u001b[A\n                                                          \u001b[A\nmultiple inference: 100%|██████████| 1/1 [00:09<00:00,  9.34s/it]",
  "metrics": {
    "predict_time": 14.178237,
    "total_time": 14.218021
  },
  "output": [
    "https://replicate.delivery/pbxt/e3ADRhee5YRfZS3BD9DFNycivG5vNhNWtUCm2R6vrtMs0vJIB/depth_bw.png",
    "https://replicate.delivery/pbxt/9ZiEje1wIfiOaU7QdNbhOVH5fuFdPUSdY3dcwnaHfUdw0vJIB/depth_colored.png"
  ],
  "started_at": "2023-12-15T08:24:30.907517Z",
  "status": "succeeded",
  "urls": {
    "get": "https://api.replicate.com/v1/predictions/rwasfmdbcx25iw2kneyh67w2fy",
    "cancel": "https://api.replicate.com/v1/predictions/rwasfmdbcx25iw2kneyh67w2fy/cancel"
  },
  "version": "1a363593bc4882684fc58042d19db5e13a810e44e02f8d4c32afd1eb30464818"
}

Generated in

14.2 seconds

Tweak itReport

Examples

View more examples

Run time and cost

This model costs approximately $0.11 to run on Replicate, or 9 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 114 seconds. The predict time for this model varies significantly based on the inputs.

Readme

Marigold

Marigold is a diffusion model and associated fine-tuning protocol for monocular depth estimation. See the original repository and paper for details.

API Usage

To use the model, simply provide upload the image (ideally RGB or grayscale) you would like to perform depth estimation for. The API returns two depth map images - one grayscale and one spectral.

Input parameters are as follows:
- image: RGB or grayscale input image for the model, use an RGB image for best results.
- resize_input: whether to resize the input image to max resolution of 768, default to True.
- num_infer: number of inferences to be performed. if >1, multiple depth predictions are ensembled. A higher number yields better results but runs slower.
- denoise_steps: number of inference denoising steps, more steps results in higher accuracy but slower inference speed.
- regularizer_strength: ensembling parameter, weight of optimization regularizer.
- reduction_method: ensembling parameter, method to merge aligned depth maps. Choose between ["mean", "medium"].
- max_iter: ensembling parameter, max number of optimization iterations.
- seed: (optional) seed for reproducibility, set to random if left as None.

References

@misc{ke2023repurposing,
      title={Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation}, 
      author={Bingxin Ke and Anton Obukhov and Shengyu Huang and Nando Metzger and Rodrigo Caye Daudt and Konrad Schindler},
      year={2023},
      eprint={2312.02145},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}