lucataco / clip-interrogator

CLIP Interrogator (for faster inference)

  • Public
  • 101.7K runs
  • GitHub
  • Paper
  • License

Input

Output

Run time and cost

This model runs on Nvidia A40 (Large) GPU hardware. Predictions typically complete within 3 minutes. The predict time for this model varies significantly based on the inputs.

Readme

This is an attempt to replicate the model pharmapsychotic/clip-interrogator to run on an A40 GPU for faster inference times

The CLIP Interrogator is a prompt engineering tool that combines OpenAI’s CLIP and Salesforce’s BLIP to optimize text prompts to match a given image. Use the resulting prompts with text-to-image models like Stable Diffusion to create cool art

Based on the original cog: https://replicate.com/pharmapsychotic/clip-interrogator

Give me a follow if you like my work! @lucataco93