cjwbw/daclip-uir – Run with an API on Replicate

cjwbw / daclip-uir

Controlling Vision-Language Models for Universal Image Restoration

Cold

Public
2.2K runs
L40S
GitHub
Paper
License

Iterate in playground

Run with an API

Playground API Examples README Versions

Input

Run this model in Node.js with one line of code:

npx create-replicate --model=cjwbw/daclip-uir

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run cjwbw/daclip-uir using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const output = await replicate.run(
  "cjwbw/daclip-uir:73496e1d669145e494f61f1f8385522063de91fc9949d8726562ec782ba94232",
  {
    input: {
      image: "https://replicate.delivery/pbxt/JgmLnELWABG1jAGQbumGUm5Un0sCYvAK9v8vD6uptBfIAdvE/00006.jpg"
    }
  }
);

// To access the file URL:
console.log(output.url()); //=> "http://example.com"

// To write the file to disk:
fs.writeFile("my-image.png", output);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run cjwbw/daclip-uir using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "cjwbw/daclip-uir:73496e1d669145e494f61f1f8385522063de91fc9949d8726562ec782ba94232",
    input={
        "image": "https://replicate.delivery/pbxt/JgmLnELWABG1jAGQbumGUm5Un0sCYvAK9v8vD6uptBfIAdvE/00006.jpg"
    }
)
print(output)

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run cjwbw/daclip-uir using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "cjwbw/daclip-uir:73496e1d669145e494f61f1f8385522063de91fc9949d8726562ec782ba94232",
    "input": {
      "image": "https://replicate.delivery/pbxt/JgmLnELWABG1jAGQbumGUm5Un0sCYvAK9v8vD6uptBfIAdvE/00006.jpg"
    }
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

You can run this model locally using Cog. First, install Cog:

brew install cog

If you don’t have Homebrew, there are other installation options available.

Run this to download the model and run it in your local environment:

cog predict r8.im/cjwbw/daclip-uir@sha256:73496e1d669145e494f61f1f8385522063de91fc9949d8726562ec782ba94232 \
  -i 'image="https://replicate.delivery/pbxt/JgmLnELWABG1jAGQbumGUm5Un0sCYvAK9v8vD6uptBfIAdvE/00006.jpg"'

To learn more, take a look at the Cog documentation.

Run this to download the model and run it in your local environment:

docker run -d -p 5000:5000 --gpus=all r8.im/cjwbw/daclip-uir@sha256:73496e1d669145e494f61f1f8385522063de91fc9949d8726562ec782ba94232
curl -s -X POST \
  -H "Content-Type: application/json" \
  -d $'{
    "input": {
      "image": "https://replicate.delivery/pbxt/JgmLnELWABG1jAGQbumGUm5Un0sCYvAK9v8vD6uptBfIAdvE/00006.jpg"
    }
  }' \
  http://localhost:5000/predictions

To learn more, take a look at the Cog documentation.

Output

Generated in

6.4 seconds

Tweak itReport

This output was created using a different version of the model, cjwbw/daclip-uir:5efbc85e.

Examples

View more examples

Run time and cost

This model costs approximately $0.063 to run on Replicate, or 15 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 65 seconds. The predict time for this model varies significantly based on the inputs.

Readme

Controlling Vision-Language Models for Universal Image Restoration

Notice!!

🙁 In testing we found that the current pretrained model is still difficult to process some real-world images which might have distribution shifts with our training dataset (captured from different devices or with different resolutions or degradations). We regard it as a future work and will try to make our model more practical! We also encourage users who are interested in our work to train their own models with larger dataset and more degradation types.

🙁 BTW, we also found that directly resizing input images will lead a poor performance for most tasks. We could try to add the resize step into the training but it always destroys the image quality due to interpolation.

🙁 For the inpainting task our current model only supports face inpainting due to the dataset limitation. We provide our mask examples and you can use the generate_masked_face script to generate uncompleted faces.

Overview framework:

daclip

Acknowledgment: Our DA-CLIP is based on IR-SDE and open_clip. Thanks for their code!

Contact

If you have any question, please contact: ziwei.luo@it.uu.se

Citations

If our code helps your research or work, please consider citing our paper. The following are BibTeX references:

@article{luo2023controlling,
  title={Controlling Vision-Language Models for Universal Image Restoration},
  author={Luo, Ziwei and Gustafsson, Fredrik K and Zhao, Zheng and Sj{\"o}lund, Jens and Sch{\"o}n, Thomas B},
  journal={arXiv preprint arXiv:2310.01018},
  year={2023}
}