Predictions run on Nvidia T4 GPU hardware. Predictions typically complete within 109 seconds. The predict time for this model varies significantly based on the inputs.
erlich is the text2image latent diffusion model from CompVis (with additions from
glid-3-xl) finetuned on a dataset collected from LAION-5B named Large Logo Dataset. It consists of roughly 100K images of logos with captions generated via BLIP using aggressive re-ranking.
For more info see the README.md for