pharmapsychotic/clip-interrogator | Run with an API on Replicate

pharmapsychotic / clip-interrogator

The CLIP Interrogator is a prompt engineering tool that combines OpenAI's CLIP and Salesforce's BLIP to optimize text prompts to match a given image. Use the resulting prompts with text-to-image models like Stable Diffusion to create cool art!

Warm

Public
3.9M runs
T4
GitHub
License

Iterate in playground

Run with an API

Playground API Examples README Versions

Input

Run this model in Node.js with one line of code:

npx create-replicate --model=pharmapsychotic/clip-interrogator

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run pharmapsychotic/clip-interrogator using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const output = await replicate.run(
  "pharmapsychotic/clip-interrogator:8151e1c9f47e696fa316146a2e35812ccf79cfc9eba05b11c7f450155102af70",
  {
    input: {
      mode: "best",
      image: "https://replicate.delivery/pbxt/HrXsgowfhbZi3dImGZoIcvnz7oZfMtFY4UAEU8vBIakTd8JQ/watercolour-4799014_960_720.jpg",
      clip_model_name: "ViT-L/14"
    }
  }
);

console.log(output);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run pharmapsychotic/clip-interrogator using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "pharmapsychotic/clip-interrogator:8151e1c9f47e696fa316146a2e35812ccf79cfc9eba05b11c7f450155102af70",
    input={
        "mode": "best",
        "image": "https://replicate.delivery/pbxt/HrXsgowfhbZi3dImGZoIcvnz7oZfMtFY4UAEU8vBIakTd8JQ/watercolour-4799014_960_720.jpg",
        "clip_model_name": "ViT-L/14"
    }
)
print(output)

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run pharmapsychotic/clip-interrogator using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "pharmapsychotic/clip-interrogator:8151e1c9f47e696fa316146a2e35812ccf79cfc9eba05b11c7f450155102af70",
    "input": {
      "mode": "best",
      "image": "https://replicate.delivery/pbxt/HrXsgowfhbZi3dImGZoIcvnz7oZfMtFY4UAEU8vBIakTd8JQ/watercolour-4799014_960_720.jpg",
      "clip_model_name": "ViT-L/14"
    }
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

Output

a watercolor painting of a sea turtle, a digital painting, by Kubisi art, featured on dribbble, medibang, warm saturated palette, red and green tones, turquoise horizon, digital art h 9 6 0, detailed scenery —width 672, illustration:.4, spray art, artstatiom

{
  "completed_at": "2022-11-27T22:32:41.135591Z",
  "created_at": "2022-11-27T22:29:05.240885Z",
  "data_removed": false,
  "error": null,
  "id": "6iveqabqxrf55lw2xez2owwn6e",
  "input": {
    "image": "https://replicate.delivery/pbxt/HrXsgowfhbZi3dImGZoIcvnz7oZfMtFY4UAEU8vBIakTd8JQ/watercolour-4799014_960_720.jpg",
    "clip_model_name": "ViT-L/14"
  },
  "logs": "0%|          | 0/50 [00:00<?, ?it/s]\n 24%|██▍       | 12/50 [00:00<00:00, 119.71it/s]\n 48%|████▊     | 24/50 [00:00<00:00, 117.53it/s]\n 72%|███████▏  | 36/50 [00:00<00:00, 117.99it/s]\n 96%|█████████▌| 48/50 [00:00<00:00, 118.17it/s]\n100%|██████████| 50/50 [00:00<00:00, 119.05it/s]\n  0%|          | 0/6 [00:00<?, ?it/s]\n100%|██████████| 6/6 [00:00<00:00, 93.98it/s]\n  0%|          | 0/1 [00:00<?, ?it/s]\n100%|██████████| 1/1 [00:00<00:00, 33.81it/s]\nFlavor chain:   0%|          | 0/25 [00:00<?, ?it/s]\nFlavor chain:   4%|▍         | 1/25 [00:03<01:12,  3.01s/it]\nFlavor chain:   8%|▊         | 2/25 [00:05<01:08,  2.97s/it]\nFlavor chain:  12%|█▏        | 3/25 [00:09<01:06,  3.02s/it]\nFlavor chain:  16%|█▌        | 4/25 [00:12<01:03,  3.05s/it]\nFlavor chain:  20%|██        | 5/25 [00:15<01:00,  3.04s/it]\nFlavor chain:  24%|██▍       | 6/25 [00:18<00:57,  3.05s/it]\nFlavor chain:  28%|██▊       | 7/25 [00:21<00:55,  3.09s/it]\nFlavor chain:  32%|███▏      | 8/25 [00:24<00:55,  3.25s/it]\nFlavor chain:  36%|███▌      | 9/25 [00:28<00:53,  3.33s/it]\nFlavor chain:  40%|████      | 10/25 [00:32<00:51,  3.45s/it]\nFlavor chain:  40%|████      | 10/25 [00:33<00:49,  3.31s/it]",
  "metrics": {
    "predict_time": 37.674196,
    "total_time": 215.894706
  },
  "output": "a watercolor painting of a sea turtle, a digital painting, by Kubisi art, featured on dribbble, medibang, warm saturated palette, red and green tones, turquoise horizon, digital art h 9 6 0, detailed scenery —width 672, illustration:.4, spray art, artstatiom",
  "started_at": "2022-11-27T22:32:03.461395Z",
  "status": "succeeded",
  "urls": {
    "get": "https://api.replicate.com/v1/predictions/6iveqabqxrf55lw2xez2owwn6e",
    "cancel": "https://api.replicate.com/v1/predictions/6iveqabqxrf55lw2xez2owwn6e/cancel"
  },
  "version": "41fdb702d3fbe06c9835e39ac6ddfb8dee872835b22d6e8a7e0a467c793155f9"
}

Generated in

37.7 seconds

Tweak it Report View full prediction

This output was created using a different version of the model, pharmapsychotic/clip-interrogator:41fdb702.

Run time and cost

This model costs approximately $0.00064 to run on Replicate, or 1562 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 3 seconds.

Readme

The CLIP Interrogator uses the OpenAI CLIP models to test a given image against a variety of artists, mediums, and styles to study how the different models see the content of the image. It also combines the results with BLIP caption to suggest a text prompt to create more images similar to what was given.