These models generate images from text prompts. Here, you can find the latest state-of-the-art image models that fit your use case.
The best overall image generation model is google/nano-banana-pro. It offers state-of-the-art performance in prompt following, visual quality, text adherence, image detail, and output diversity. This image generation model also has logic built in, allowing you to answer questions with image outputs.
P-image from Pruna AI offers sub-second image generation for less than a cent while still maintaining quality. Additionally, prunaai/z-image-turbo speedily produces stunning image outputs.
Text rendering with google/nano-banana-pro is astonishing and has the ability to handle paragraphs of text.
As a cheaper (and underrated) option, Ideogram models are strong in many areas, but they're especially known for their ability to generate realistic, legible text. See our blog on Ideogram v3.
bytedance/seedream-4.5 is also comparable to nano-banana and is worth giving a shot if you are looking to try a new model.
The Recraft V3 SVG model is the first major text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.
If you’re a fan of ComfyUI, you can export any of your favorite ComfyUI workflows to JSON and run them on Replicate using the fofr/any-comfyui-workflow model. For more information, check out our detailed guide to using ComfyUI.
Make sure to check out our FLUX fine-tunes collection, which includes all publicly available FLUX fine-tunes hosted on Replicate. This collection should help you get a feel for the sorts of things you can do with fine-tuning.
Featured models

Google's state of the art image generation and editing model 🍌🍌
Updated 1 day, 4 hours ago
10M runs

prunaai/z-image-turboZ-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
Updated 2 days, 6 hours ago
12.3M runs

An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.
Updated 2 days, 10 hours ago
1.4M runs

prunaai/p-imageA sub 1 second text-to-image model built for production use cases.
Updated 1 week, 1 day ago
1.9M runs

Use this fast version of Imagen 4 when speed and cost are more important than quality
Updated 1 month ago
3.8M runs

The highest fidelity image model from Black Forest Labs
Updated 1 month ago
228.8K runs

bytedance/seedream-4.5Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge
Updated 1 month, 2 weeks ago
2.1M runs

Google's latest image editing model in Gemini 2.5
Updated 1 month, 4 weeks ago
74.3M runs

bytedance/seedream-4Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
Updated 2 months ago
23.6M runs

State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.
Updated 2 months, 1 week ago
13.8M runs

ideogram-ai/ideogram-v3-turboTurbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 2 months, 2 weeks ago
6.6M runs

The fastest image generation model tailored for local development and personal use
Updated 6 months, 4 weeks ago
599.6M runs
Recommended Models
Speed depends on the model’s architecture and how it’s optimized for the hardware it runs on. If you want quick results, models like google/imagen-4-fast and black-forest-labs/flux-schnell are built to return outputs fast, which is great for rapid iteration.
Smaller or “fast” variants usually cost less to run. bytedance/seedream-4.5 and ideogram-ai/ideogram-v3-turbo are good picks if you want solid image quality without spending a lot.
Text-to-image models create a new image from scratch based on your text prompt. Image-to-image models take an existing image and use your prompt to change or build on it. Think of it as “paint something new” vs. “edit what’s already there.”
bytedance/seedream-4.5 and ideogram-ai/ideogram-v3-turbo are great for realistic lighting, textures, and faces. They’re popular for lifelike portraits, product shots, and scenery.
If you’re aiming for a specific look, black-forest-labs/flux-1.1-pro and black-forest-labs/flux-schnell give you more control over style, lighting, and composition. They’re good for illustrations, concept art, or anything with a creative twist.
Yes. Use text-guided editing models like black-forest-labs/flux-kontext-pro or bytedance/seedream-4.5 to add or change details in an existing image. For example, you can tell it to “add sunglasses” or “turn it into a painting.”
Most models output images between 512×512 and 4K. Check the model card for the exact dimensions supported. Higher resolutions can cost more and take a bit longer to run.
Use a reference image or a fixed seed. Models like black-forest-labs/flux-kontext-pro and ideogram-ai/ideogram-v3-turbo support both, so you can keep the same look across multiple runs.
Some models support fine-tuning. Look for the fine-tune tag on the model page or check the README for training details.
Yes. Push a model from GitHub with a replicate.yaml file. Once it’s built, it runs on the same infrastructure as other models.
Check the “License” section on the model page. Some licenses allow commercial use, others don’t. Always make sure before using outputs in anything public or commercial.
Recommended Models

Google's Imagen 4 flagship model
Updated 1 month ago
7.2M runs

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty
Updated 1 month ago
1.9M runs

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality
Updated 1 month ago
544.8K runs

Use this ultra version of Imagen 4 when quality matters more than speed and cost
Updated 1 month ago
1.3M runs

prunaai/hidream-l1-fastThis is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!
Updated 1 month, 1 week ago
7.7M runs

Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market
Updated 1 month, 2 weeks ago
137.4K runs

SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.
Updated 1 month, 2 weeks ago
10K runs

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts
Updated 2 months, 1 week ago
9.6M runs

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language
Updated 2 months, 1 week ago
43.7M runs

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.
Updated 2 months, 1 week ago
19.9M runs

bytedance/seedream-3A text-to-image model with support for native high-resolution (2K) image generation
Updated 2 months, 1 week ago
3.2M runs

prunaai/flux-fastThis is the fastest Flux endpoint in the world.
Updated 2 months, 2 weeks ago
37.9M runs

Artistic and high-quality visuals with improved prompt adherence, diversity, and definition
Updated 2 months, 2 weeks ago
222.5K runs

recraft-ai/recraft-v3Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis
Updated 2 months, 2 weeks ago
7.6M runs

recraft-ai/recraft-v3-svgRecraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.
Updated 2 months, 2 weeks ago
350.7K runs

ideogram-ai/ideogram-v2a-turboLike Ideogram v2 turbo, but now faster and cheaper
Updated 2 months, 2 weeks ago
379.2K runs

ideogram-ai/ideogram-v2An excellent image model with state of the art inpainting, prompt comprehension and text rendering
Updated 2 months, 2 weeks ago
2.6M runs

ideogram-ai/ideogram-v3-qualityThe highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 2 months, 2 weeks ago
2.1M runs

ideogram-ai/ideogram-v2aLike Ideogram v2, but faster and cheaper
Updated 2 months, 2 weeks ago
2M runs

ideogram-ai/ideogram-v3-balancedBalance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles
Updated 2 months, 2 weeks ago
364.9K runs

ideogram-ai/ideogram-v2-turboA fast image model with state of the art inpainting, prompt comprehension and text rendering.
Updated 2 months, 2 weeks ago
2.9M runs

stability-ai/stable-diffusion-3.5-medium2.5 billion parameter image model with improved MMDiT-X architecture
Updated 2 months, 2 weeks ago
99.7K runs

stability-ai/stable-diffusion-3.5-largeA text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.
Updated 2 months, 2 weeks ago
1.9M runs

stability-ai/stable-diffusion-3.5-large-turboA text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps
Updated 2 months, 2 weeks ago
885.6K runs

Minimax's first image model, with character reference support
Updated 2 months, 2 weeks ago
2.5M runs

luma/photon-flashAccelerated variant of Photon prioritizing speed while maintaining quality
Updated 2 months, 2 weeks ago
250.3K runs

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui
Updated 2 months, 2 weeks ago
7.5M runs

tencent/hunyuan-image-3A powerful native multimodal model for image generation (PrunaAI squeezed)
Updated 3 months, 2 weeks ago
45.4K runs

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Updated 5 months, 4 weeks ago
1.1M runs

prunaai/wan-2.2-imageThis model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package
Updated 6 months ago
992K runs

prunaai/hidream-l1-fullThis is an optimised version of the hidream-full model using the pruna ai optimisation toolkit!
Updated 6 months, 1 week ago
33.7K runs

prunaai/hidream-l1-devThis is an optimised version of the hidream-l1-dev model using the pruna ai optimisation toolkit!
Updated 6 months, 1 week ago
47.9K runs

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference
Updated 6 months, 4 weeks ago
5.5M runs

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
Updated 6 months, 4 weeks ago
38.3M runs

prunaai/sdxl-lightningThis is the fastest sdxl-lightning endpoint in the world on A100, contact us for more at pruna.ai
Updated 7 months, 1 week ago
3.4K runs

bytedance/sdxl-lightning-4stepSDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Updated 10 months ago
1B runs

A fast image model with wide artistic range and resolutions up to 4096x4096
Updated 1 year, 1 month ago
231.5K runs

luma/photonHigh-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs
Updated 1 year, 1 month ago
3.2M runs

stability-ai/sdxlA text-to-image generative AI model that creates beautiful images
Updated 1 year, 7 months ago
83.7M runs

fofr/sticker-makerMake stickers with AI. Generates graphics with transparent backgrounds.
Updated 1 year, 8 months ago
1.9M runs

ai-forever/kandinsky-2text2img model trained on LAION HighRes and fine-tuned on internal datasets
Updated 1 year, 9 months ago
6.2M runs

ai-forever/kandinsky-2.2multilingual text2image latent diffusion model
Updated 1 year, 9 months ago
10M runs

playgroundai/playground-v2.5-1024px-aestheticPlayground v2.5 is the state-of-the-art open-source model in aesthetic quality
Updated 1 year, 10 months ago
2.9M runs

datacte/proteus-v0.3ProteusV0.3: The Anime Update
Updated 1 year, 11 months ago
5.2M runs

'''Last update: Now supports img2img.''' SDXL Canny controlnet with LoRA support.
Updated 1 year, 11 months ago
991K runs

datacte/proteus-v0.2Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding that surpasses MJ6, while also approaching its stylistic capabilities.
Updated 1 year, 11 months ago
11.6M runs

adirik/realvisxl-v3.0-turboPhotorealism with RealVisXL V3.0 Turbo based on SDXL
Updated 2 years ago
596.8K runs

fofr/latent-consistency-modelSuper-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet
Updated 2 years ago
1.5M runs

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting
Updated 2 years ago
2M runs

lucataco/open-dalle-v1.1A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension
Updated 2 years ago
132.6K runs

fofr/sdxl-multi-controlnet-loraMulti-controlnet, lora loading, img2img, inpainting
Updated 2 years, 1 month ago
216.4K runs

lucataco/dreamshaper-xl-turboDreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.
Updated 2 years, 1 month ago
228.7K runs

lucataco/ssd-1bSegmind Stable Diffusion Model (SSD-1B) is a distilled 50% smaller version of SDXL, offering a 60% speedup while maintaining high-quality text-to-image generation capabilities
Updated 2 years, 2 months ago
1M runs

fofr/sdxl-emojiAn SDXL fine-tune based on Apple Emojis
Updated 2 years, 4 months ago
11.6M runs

lucataco/realistic-vision-v5.1Implementation of Realistic Vision v5.1 with VAE
Updated 2 years, 5 months ago
4.3M runs

stability-ai/stable-diffusionA latent text-to-image diffusion model capable of generating photo-realistic images given any text input
Updated 2 years, 6 months ago
110.9M runs

jagilley/controlnet-scribbleGenerate detailed images from scribbled drawings
Updated 2 years, 11 months ago
38.3M runs

tstramer/material-diffusionStable diffusion fork for generating tileable outputs using v1.5 model
Updated 3 years, 2 months ago
2.4M runs