These models generate images from text prompts. Many of these models are based on Stable Diffusion and FLUX.1.
Learn more about the latest FLUX.1 Kontext.
The best overall image generation model is bytedance/seedream-3. It offers state-of-the-art performance in prompt following, visual quality, image detail, and output diversity.
Use this fast version of black-forest-labs/flux-schnell when speed and cost are more important than quality
Ideogram models are strong in many areas, but they're especially known for their ability to generate realistic, legible text. See our blog on Ideogram v3.
The Recraft V3 SVG model is the first major text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.
If you’re a fan of ComfyUI, you can export any of your favorite ComfyUI workflows to JSON and run them on Replicate using the fofr/any-comfyui-workflow model. For more information, check out our detailed guide to using ComfyUI.
Make sure to check out our FLUX fine-tunes collection, which includes all publicly available FLUX fine-tunes hosted on Replicate. This collection should help you get a feel for the sorts of things you can do with fine-tuning.
Featured models

Use this fast version of Imagen 4 when speed and cost are more important than quality
Updated 1 week ago
2.3M runs

bytedance/seedream-4Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
Updated 1 week, 5 days ago
15.6M runs

Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.
Updated 3 weeks, 1 day ago
64.7M runs

ideogram-ai/ideogram-v3-turboTurbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 3 weeks, 4 days ago
4.9M runs

An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.
Updated 2 months, 3 weeks ago
1.1M runs

The fastest image generation model tailored for local development and personal use
Updated 5 months, 1 week ago
559.9M runs
Recommended Models
Speed depends on the model’s architecture and how it’s optimized for the hardware it runs on. If you want quick results, models like google/imagen-4-fast and black-forest-labs/flux-schnell are built to return outputs fast, which is great for rapid iteration.
Smaller or “fast” variants usually cost less to run. bytedance/seedream-4 and ideogram-ai/ideogram-v3-turbo are good picks if you want solid image quality without spending a lot.
Text-to-image models create a new image from scratch based on your text prompt. Image-to-image models take an existing image and use your prompt to change or build on it. Think of it as “paint something new” vs. “edit what’s already there.”
bytedance/seedream-4 and ideogram-ai/ideogram-v3-turbo are great for realistic lighting, textures, and faces. They’re popular for lifelike portraits, product shots, and scenery.
If you’re aiming for a specific look, black-forest-labs/flux-1.1-pro and black-forest-labs/flux-schnell give you more control over style, lighting, and composition. They’re good for illustrations, concept art, or anything with a creative twist.
Yes. Use text-guided editing models like black-forest-labs/flux-kontext-pro or bytedance/seedream-4 to add or change details in an existing image. For example, you can tell it to “add sunglasses” or “turn it into a painting.”
Most models output images between 512×512 and 4K. Check the model card for the exact dimensions supported. Higher resolutions can cost more and take a bit longer to run.
Use a reference image or a fixed seed. Models like black-forest-labs/flux-kontext-pro and ideogram-ai/ideogram-v3-turbo support both, so you can keep the same look across multiple runs.
Some models support fine-tuning. Look for the fine-tune tag on the model page or check the README for training details.
Yes. Push a model from GitHub with a replicate.yaml file. Once it’s built, it runs on the same infrastructure as other models.
Check the “License” section on the model page. Some licenses allow commercial use, others don’t. Always make sure before using outputs in anything public or commercial.
Recommended Models

Google's latest image editing model in Gemini 2.5
Updated 1 week ago
48.4M runs

Google's Imagen 4 flagship model
Updated 1 week ago
6.2M runs

Use this ultra version of Imagen 4 when quality matters more than speed and cost
Updated 1 week ago
1.2M runs

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality
Updated 1 week ago
511.4K runs

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty
Updated 1 week ago
1.9M runs

Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market
Updated 1 week, 1 day ago
102.6K runs

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts
Updated 2 weeks, 6 days ago
9M runs

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language
Updated 2 weeks, 6 days ago
38M runs

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.
Updated 3 weeks, 1 day ago
19.2M runs

State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.
Updated 3 weeks, 1 day ago
13.7M runs

bytedance/seedream-3A text-to-image model with support for native high-resolution (2K) image generation
Updated 3 weeks, 1 day ago
3M runs

prunaai/hidream-l1-fastThis is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!
Updated 3 weeks, 3 days ago
5.8M runs

prunaai/flux-fastThis is the fastest Flux endpoint in the world.
Updated 3 weeks, 3 days ago
36M runs

Artistic and high-quality visuals with improved prompt adherence, diversity, and definition
Updated 3 weeks, 4 days ago
168.6K runs

recraft-ai/recraft-v3Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis
Updated 3 weeks, 4 days ago
7.1M runs

recraft-ai/recraft-v3-svgRecraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.
Updated 3 weeks, 4 days ago
324.5K runs

ideogram-ai/ideogram-v2a-turboLike Ideogram v2 turbo, but now faster and cheaper
Updated 3 weeks, 4 days ago
369.1K runs

ideogram-ai/ideogram-v2An excellent image model with state of the art inpainting, prompt comprehension and text rendering
Updated 3 weeks, 4 days ago
2.6M runs

ideogram-ai/ideogram-v3-qualityThe highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 3 weeks, 4 days ago
2M runs

ideogram-ai/ideogram-v2aLike Ideogram v2, but faster and cheaper
Updated 3 weeks, 4 days ago
2M runs

ideogram-ai/ideogram-v3-balancedBalance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles
Updated 3 weeks, 4 days ago
336.4K runs

ideogram-ai/ideogram-v2-turboA fast image model with state of the art inpainting, prompt comprehension and text rendering.
Updated 3 weeks, 4 days ago
2.8M runs

SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.
Updated 3 weeks, 4 days ago
2.2K runs

stability-ai/stable-diffusion-3.5-medium2.5 billion parameter image model with improved MMDiT-X architecture
Updated 3 weeks, 4 days ago
88.2K runs

stability-ai/stable-diffusion-3.5-largeA text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.
Updated 3 weeks, 4 days ago
1.8M runs

stability-ai/stable-diffusion-3.5-large-turboA text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps
Updated 3 weeks, 4 days ago
833.8K runs

Minimax's first image model, with character reference support
Updated 3 weeks, 4 days ago
2.1M runs

luma/photon-flashAccelerated variant of Photon prioritizing speed while maintaining quality
Updated 3 weeks, 4 days ago
191.6K runs

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui
Updated 4 weeks ago
7M runs

tencent/hunyuan-image-3A powerful native multimodal model for image generation (PrunaAI squeezed)
Updated 1 month, 3 weeks ago
31.1K runs

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Updated 4 months, 1 week ago
904.4K runs

prunaai/wan-2.2-imageThis model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package
Updated 4 months, 2 weeks ago
828.7K runs

prunaai/hidream-l1-fullThis is an optimised version of the hidream-full model using the pruna ai optimisation toolkit!
Updated 4 months, 2 weeks ago
32.7K runs

prunaai/hidream-l1-devThis is an optimised version of the hidream-l1-dev model using the pruna ai optimisation toolkit!
Updated 4 months, 2 weeks ago
47.2K runs

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference
Updated 5 months, 1 week ago
5.2M runs

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
Updated 5 months, 1 week ago
33.4M runs

prunaai/sdxl-lightningThis is the fastest sdxl-lightning endpoint in the world on A100, contact us for more at pruna.ai
Updated 5 months, 3 weeks ago
3.2K runs

bytedance/sdxl-lightning-4stepSDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Updated 8 months, 2 weeks ago
1B runs

A fast image model with wide artistic range and resolutions up to 4096x4096
Updated 11 months, 3 weeks ago
219.2K runs

luma/photonHigh-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs
Updated 11 months, 3 weeks ago
3.1M runs

stability-ai/sdxlA text-to-image generative AI model that creates beautiful images
Updated 1 year, 6 months ago
82.9M runs

fofr/sticker-makerMake stickers with AI. Generates graphics with transparent backgrounds.
Updated 1 year, 7 months ago
1.8M runs

ai-forever/kandinsky-2text2img model trained on LAION HighRes and fine-tuned on internal datasets
Updated 1 year, 7 months ago
6.2M runs

ai-forever/kandinsky-2.2multilingual text2image latent diffusion model
Updated 1 year, 7 months ago
10M runs

playgroundai/playground-v2.5-1024px-aestheticPlayground v2.5 is the state-of-the-art open-source model in aesthetic quality
Updated 1 year, 8 months ago
2.8M runs

datacte/proteus-v0.3ProteusV0.3: The Anime Update
Updated 1 year, 9 months ago
4.9M runs

fermatresearch/sdxl-controlnet-lora'''Last update: Now supports img2img.''' SDXL Canny controlnet with LoRA support.
Updated 1 year, 10 months ago
979.4K runs

datacte/proteus-v0.2Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding that surpasses MJ6, while also approaching its stylistic capabilities.
Updated 1 year, 10 months ago
11.2M runs

adirik/realvisxl-v3.0-turboPhotorealism with RealVisXL V3.0 Turbo based on SDXL
Updated 1 year, 10 months ago
549.2K runs

fofr/latent-consistency-modelSuper-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet
Updated 1 year, 10 months ago
1.5M runs

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting
Updated 1 year, 10 months ago
2M runs

lucataco/open-dalle-v1.1A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension
Updated 1 year, 11 months ago
132.5K runs

fofr/sdxl-multi-controlnet-loraMulti-controlnet, lora loading, img2img, inpainting
Updated 1 year, 11 months ago
215.1K runs

lucataco/dreamshaper-xl-turboDreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.
Updated 1 year, 11 months ago
226.8K runs

lucataco/ssd-1bSegmind Stable Diffusion Model (SSD-1B) is a distilled 50% smaller version of SDXL, offering a 60% speedup while maintaining high-quality text-to-image generation capabilities
Updated 2 years ago
1M runs

fofr/sdxl-emojiAn SDXL fine-tune based on Apple Emojis
Updated 2 years, 2 months ago
11.4M runs

lucataco/realistic-vision-v5.1Implementation of Realistic Vision v5.1 with VAE
Updated 2 years, 3 months ago
4.3M runs

stability-ai/stable-diffusionA latent text-to-image diffusion model capable of generating photo-realistic images given any text input
Updated 2 years, 4 months ago
110.8M runs

jagilley/controlnet-scribbleGenerate detailed images from scribbled drawings
Updated 2 years, 9 months ago
38.3M runs

tstramer/material-diffusionStable diffusion fork for generating tileable outputs using v1.5 model
Updated 3 years ago
2.4M runs