These models generate images from text prompts. Here, you can find the latest state-of-the-art image models that fit your use case.
GPT Image 1.5 from OpenAI is the strongest all-around image generation model right now. It follows complex prompts accurately, renders readable text, and handles everything from photorealistic scenes to infographics and UI mockups. It also works as an image editor — make targeted changes while preserving everything else. Requires an OpenAI API key.
Nano Banana Pro from Google is close behind — it reasons about your prompt using Gemini 3 Pro, renders multilingual text accurately, and can blend up to 14 input images into a single composition. It connects to Google Search for real-time information, which makes it especially good for data-driven visuals and infographics. Supports up to 4K output.
FLUX.2 Max from Black Forest Labs delivers the highest fidelity in the FLUX family. It excels at product photography, character consistency across batches (up to 8 reference images), and precise color control via hex codes. Great for e-commerce, fashion, and any workflow where consistency matters.
FLUX.2 Pro offers similar capabilities at a lower price. It supports structured JSON prompting for precise control over camera angle, lighting, and composition, and handles up to 8 reference images. A good choice for high-volume production work.
Seedream 4 from ByteDance combines generation and editing in one model. It produces images at up to 4K resolution with fast inference, and supports batch outputs and multi-reference input. Strong at style transfer — watercolor, cyberpunk, architectural, and more.
Seedream 4.5 upgrades Seedream 4 with cinematic aesthetics, stronger spatial understanding, and richer world knowledge. It produces film-like visuals with refined lighting and shading, and is particularly good at realistic proportions and structured environments.
FLUX.2 Flex is the typography specialist in the FLUX family. It reliably renders clean text, captions, and complex layouts — perfect for memes, posters, infographics, and UI mockups. You can adjust the quality-speed trade-off by changing the number of steps, making it great for rapid iteration. Supports up to 10 reference images.
Ideogram v3 is built for graphic design and branding. It generates precise text, supports style references (upload up to 3 images or use 4.3 billion style presets), and produces clean layouts for logos, posters, and marketing materials. Available in Turbo, Balanced, and Quality tiers.
Recraft V4 takes a design-first approach — every output feels art-directed rather than generic. Strong integrated text rendering, intentional composition, and refined color relationships. Good for brand assets, editorial photography, and print-ready work.
Recraft V4 SVG generates native, editable SVG vector files — not traced rasters. Output opens directly in Illustrator, Figma, or Sketch with clean paths and structured layers. The only image generation model that produces true vector output. Use it for logos, icons, illustrations, and any asset that needs to scale.
Imagen 4 Fast and FLUX Schnell are built for quick iteration — use them when you need fast results at lower cost.
Ideogram v3 Turbo gives you solid image quality with good text rendering at $0.03 per image.
Compare models side by side in the playground to find what works best for your project.
Questions? Join us on Discord.
Featured models

High-quality image generation and editing with support for eight reference images
Updated 1Â day, 7Â hours ago
4.4M runs

Max-quality image generation and editing with support for ten reference images
Updated 1Â week, 1Â day ago
196.8K runs

The highest fidelity image model from Black Forest Labs
Updated 1Â week, 1Â day ago
1.2M runs

Google's state of the art image generation and editing model 🍌🍌
Updated 3Â weeks, 6Â days ago
18.2M runs

openai/gpt-image-1.5OpenAI's latest image generation model with better instruction following and adherence to prompts
Updated 2Â months ago
6.3M runs

bytedance/seedream-4Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
Updated 4Â months ago
30.5M runs
Recommended Models
Speed depends on the model’s architecture and how it’s optimized for the hardware it runs on. If you want quick results, models like google/imagen-4-fast and black-forest-labs/flux-schnell are built to return outputs fast, which is great for rapid iteration.
Smaller or “fast” variants usually cost less to run. bytedance/seedream-4.5 and ideogram-ai/ideogram-v3-turbo are good picks if you want solid image quality without spending a lot.
Text-to-image models create a new image from scratch based on your text prompt. Image-to-image models take an existing image and use your prompt to change or build on it. Think of it as “paint something new” vs. “edit what’s already there.”
bytedance/seedream-4.5 and ideogram-ai/ideogram-v3-turbo are great for realistic lighting, textures, and faces. They’re popular for lifelike portraits, product shots, and scenery.
If you’re aiming for a specific look, black-forest-labs/flux-1.1-pro and black-forest-labs/flux-schnell give you more control over style, lighting, and composition. They’re good for illustrations, concept art, or anything with a creative twist.
Yes. Use text-guided editing models like black-forest-labs/flux-kontext-pro or bytedance/seedream-4.5 to add or change details in an existing image. For example, you can tell it to “add sunglasses” or “turn it into a painting.”
Most models output images between 512Ă—512 and 4K. Check the model card for the exact dimensions supported. Higher resolutions can cost more and take a bit longer to run.
Use a reference image or a fixed seed. Models like black-forest-labs/flux-kontext-pro and ideogram-ai/ideogram-v3-turbo support both, so you can keep the same look across multiple runs.
Some models support fine-tuning. Look for the fine-tune tag on the model page or check the README for training details.
Yes. Push a model from GitHub with a replicate.yaml file. Once it’s built, it runs on the same infrastructure as other models.
Check the “License” section on the model page. Some licenses allow commercial use, others don’t. Always make sure before using outputs in anything public or commercial.
Recommended Models

prunaai/z-image-turboZ-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
Updated 3Â weeks, 4Â days ago
35.9M runs

prunaai/p-imageA sub 1 second text-to-image model built for production use cases.
Updated 3Â weeks, 6Â days ago
6.9M runs

bytedance/seedream-5-liteSeedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge
Updated 1Â month ago
362.2K runs
recraft-ai/recraft-v4-pro-svgGenerate detailed SVG vector graphics from text prompts. Recraft V4 Pro's design taste with more geometric detail and finer paths — clean layers, editable output, and scalable to any size.
Updated 1Â month ago
3.4K runs

recraft-ai/recraft-v4-proRecraft's latest image generation model at ~2048px resolution. Same design taste and prompt accuracy as V4, with higher resolution for print-ready and large-scale work.
Updated 1Â month ago
6.3K runs

recraft-ai/recraft-v4Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, and integrated text rendering. Fast and cost-efficient at standard resolution.
Updated 1Â month ago
108.4K runs
recraft-ai/recraft-v4-svgGenerate production-ready SVG vector images from text prompts. Recraft V4's design taste applied to vector output — clean geometry, structured layers, and editable paths.
Updated 1Â month ago
8.3K runs

prunaai/p-image-loraUse trained LoRAs from the https://replicate.com/prunaai/p-image-trainer. Find or contribute LoRAs here https://huggingface.co/collections/PrunaAI/p-image-loras
Updated 1Â month, 1Â week ago
4.6K runs

prunaai/hidream-l1-fastThis is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!
Updated 1Â month, 1Â week ago
8.2M runs

Google's Imagen 4 flagship model
Updated 1Â month, 1Â week ago
7.9M runs

Use this ultra version of Imagen 4 when quality matters more than speed and cost
Updated 1Â month, 1Â week ago
1.6M runs

Use this fast version of Imagen 4 when speed and cost are more important than quality
Updated 1Â month, 1Â week ago
4.8M runs

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty
Updated 1Â month, 1Â week ago
2M runs

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality
Updated 1Â month, 1Â week ago
589.2K runs

SOTA image model from xAI
Updated 1Â month, 1Â week ago
250.6K runs

Google's latest image editing model in Gemini 2.5
Updated 1Â month, 2Â weeks ago
94.3M runs

SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.
Updated 1Â month, 2Â weeks ago
12.2K runs

Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market
Updated 1Â month, 2Â weeks ago
168.7K runs

An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.
Updated 2Â months ago
1.6M runs

Very fast image generation and editing model. 4 steps distilled, sub-second inference for production and near real-time applications.
Updated 2Â months, 1Â week ago
8.3M runs

bytedance/seedream-4.5Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge
Updated 3Â months, 2Â weeks ago
5.5M runs

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts
Updated 4Â months, 1Â week ago
10.6M runs

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language
Updated 4Â months, 1Â week ago
47.8M runs

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.
Updated 4Â months, 2Â weeks ago
20.5M runs

State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.
Updated 4Â months, 2Â weeks ago
14M runs

bytedance/seedream-3A text-to-image model with support for native high-resolution (2K) image generation
Updated 4Â months, 2Â weeks ago
3.3M runs

prunaai/flux-fastThis is the fastest Flux endpoint in the world.
Updated 4Â months, 2Â weeks ago
39.8M runs

Artistic and high-quality visuals with improved prompt adherence, diversity, and definition
Updated 4Â months, 2Â weeks ago
250K runs

recraft-ai/recraft-v3Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis
Updated 4Â months, 2Â weeks ago
8.1M runs

recraft-ai/recraft-v3-svgRecraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.
Updated 4Â months, 2Â weeks ago
385.1K runs

ideogram-ai/ideogram-v3-turboTurbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 4Â months, 2Â weeks ago
8.3M runs

ideogram-ai/ideogram-v2a-turboLike Ideogram v2 turbo, but now faster and cheaper
Updated 4Â months, 2Â weeks ago
386.3K runs

ideogram-ai/ideogram-v2An excellent image model with state of the art inpainting, prompt comprehension and text rendering
Updated 4Â months, 2Â weeks ago
2.7M runs

ideogram-ai/ideogram-v3-qualityThe highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 4Â months, 2Â weeks ago
2.1M runs

ideogram-ai/ideogram-v2aLike Ideogram v2, but faster and cheaper
Updated 4Â months, 2Â weeks ago
2M runs

ideogram-ai/ideogram-v3-balancedBalance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles
Updated 4Â months, 2Â weeks ago
420.3K runs

ideogram-ai/ideogram-v2-turboA fast image model with state of the art inpainting, prompt comprehension and text rendering.
Updated 4Â months, 2Â weeks ago
2.9M runs

stability-ai/stable-diffusion-3.5-medium2.5 billion parameter image model with improved MMDiT-X architecture
Updated 4Â months, 2Â weeks ago
108.4K runs

stability-ai/stable-diffusion-3.5-largeA text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.
Updated 4Â months, 2Â weeks ago
2M runs

stability-ai/stable-diffusion-3.5-large-turboA text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps
Updated 4Â months, 2Â weeks ago
998.3K runs

Minimax's first image model, with character reference support
Updated 4Â months, 2Â weeks ago
2.7M runs

luma/photon-flashAccelerated variant of Photon prioritizing speed while maintaining quality
Updated 4Â months, 2Â weeks ago
404.4K runs

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui
Updated 4Â months, 2Â weeks ago
7.9M runs

tencent/hunyuan-image-3A powerful native multimodal model for image generation (PrunaAI squeezed)
Updated 5Â months, 2Â weeks ago
63.4K runs

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Updated 8Â months ago
1.1M runs

prunaai/wan-2.2-imageThis model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package
Updated 8Â months, 1Â week ago
1.1M runs

prunaai/hidream-l1-fullThis is an optimised version of the hidream-full model using the pruna ai optimisation toolkit!
Updated 8Â months, 1Â week ago
34K runs

prunaai/hidream-l1-devThis is an optimised version of the hidream-l1-dev model using the pruna ai optimisation toolkit!
Updated 8Â months, 1Â week ago
49.2K runs

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference
Updated 8Â months, 3Â weeks ago
5.8M runs

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
Updated 8Â months, 3Â weeks ago
43.5M runs

The fastest image generation model tailored for local development and personal use
Updated 8Â months, 3Â weeks ago
637.4M runs

prunaai/sdxl-lightningThis is the fastest sdxl-lightning endpoint in the world on A100, contact us for more at pruna.ai
Updated 9Â months, 2Â weeks ago
4.2K runs

bytedance/sdxl-lightning-4stepSDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Updated 1Â year ago
1B runs

A fast image model with wide artistic range and resolutions up to 4096x4096
Updated 1Â year, 3Â months ago
244.5K runs

luma/photonHigh-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs
Updated 1Â year, 3Â months ago
3.2M runs

stability-ai/sdxlA text-to-image generative AI model that creates beautiful images
Updated 1Â year, 10Â months ago
84.3M runs

fofr/sticker-makerMake stickers with AI. Generates graphics with transparent backgrounds.
Updated 1Â year, 11Â months ago
2M runs

ai-forever/kandinsky-2text2img model trained on LAION HighRes and fine-tuned on internal datasets
Updated 1Â year, 11Â months ago
6.2M runs

ai-forever/kandinsky-2.2multilingual text2image latent diffusion model
Updated 1Â year, 11Â months ago
10.1M runs

playgroundai/playground-v2.5-1024px-aestheticPlayground v2.5 is the state-of-the-art open-source model in aesthetic quality
Updated 2Â years ago
3M runs

datacte/proteus-v0.3ProteusV0.3: The Anime Update
Updated 2Â years, 1Â month ago
5.6M runs

'''Last update: Now supports img2img.''' SDXL Canny controlnet with LoRA support.
Updated 2Â years, 1Â month ago
1M runs

datacte/proteus-v0.2Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding that surpasses MJ6, while also approaching its stylistic capabilities.
Updated 2Â years, 2Â months ago
12M runs

adirik/realvisxl-v3.0-turboPhotorealism with RealVisXL V3.0 Turbo based on SDXL
Updated 2Â years, 2Â months ago
615.5K runs

fofr/latent-consistency-modelSuper-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet
Updated 2Â years, 2Â months ago
1.5M runs

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting
Updated 2Â years, 2Â months ago
2.1M runs

lucataco/open-dalle-v1.1A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension
Updated 2Â years, 2Â months ago
132.7K runs

fofr/sdxl-multi-controlnet-loraMulti-controlnet, lora loading, img2img, inpainting
Updated 2Â years, 3Â months ago
219.1K runs

lucataco/dreamshaper-xl-turboDreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.
Updated 2Â years, 3Â months ago
230.2K runs

lucataco/ssd-1bSegmind Stable Diffusion Model (SSD-1B) is a distilled 50% smaller version of SDXL, offering a 60% speedup while maintaining high-quality text-to-image generation capabilities
Updated 2Â years, 4Â months ago
1M runs

fofr/sdxl-emojiAn SDXL fine-tune based on Apple Emojis
Updated 2Â years, 6Â months ago
11.9M runs

lucataco/realistic-vision-v5.1Implementation of Realistic Vision v5.1 with VAE
Updated 2Â years, 7Â months ago
4.3M runs

stability-ai/stable-diffusionA latent text-to-image diffusion model capable of generating photo-realistic images given any text input
Updated 2Â years, 8Â months ago
111M runs

jagilley/controlnet-scribbleGenerate detailed images from scribbled drawings
Updated 3Â years, 1Â month ago
38.3M runs

tstramer/material-diffusionStable diffusion fork for generating tileable outputs using v1.5 model
Updated 3Â years, 4Â months ago
2.4M runs