Remove or replace objects in your images.
Change the style of your images (e.g. "make this Studio Ghibli style")
Add text that looks natural. Generate images with text that matches specific fonts and styles.
Change specific parts of an image while keeping its structure. Use depth maps or edge detection to control what changes.
Create variations of your images. Keep what works while exploring new possibilities.
Need an in-depth exploration of all the latest image editing models? Check out this blog post
GPT Image 1.5 is one of the strongest options for editing photos of people while keeping them looking like themselves. It preserves facial identity, body shape, and pose when you change clothing, backgrounds, or lighting. Great for virtual try-ons, headshot retouching, and placing people into new scenes. (Requires an OpenAI API key.)
FLUX.2 Max maintains character identity across large batches — use up to 8 reference images to keep the same face consistent across dozens of outputs. Useful for campaigns, storyboards, or fashion editorials where the same person needs to appear in different settings.
Seedream 4.5 produces cinematic, film-like portraits with refined lighting and shading. Strong spatial understanding means realistic proportions and natural body positioning.
Imagen 4 Ultra renders fine detail — skin texture, individual strands of hair, fabric weave — at a level that makes people look real rather than AI-generated.
Nano Banana and Nano Banana Pro handle style changes conversationally. Describe what you want ("turn this into a watercolor painting," "make me look like a Simpsons character") and they get it right. Nano Banana Pro supports up to 14 input images and 4K output.
Grok Imagine Image from xAI is especially strong at moody aesthetics — retro anime, cyberpunk, dramatic contrast, and emotionally resonant framing. It naturally creates subdued color palettes and cinematic lighting. Fast too, at around 4 seconds per image.
Seedream 4 supports a wide range of visual styles (watercolor, cyberpunk, architectural) and can apply them to existing images or generate from scratch.
FLUX.2 Max is the top pick for product work. It handles hex color codes for exact brand colors, generates product variations from multiple angles, and transforms phone photos into polished product shots. Multi-reference support means consistent branding across an entire catalog.
FLUX.2 Pro offers similar capabilities at a lower price — structured JSON prompting gives you precise control over camera angle, lighting, and composition. A good choice for high-volume product imagery.
Seedream 4 and Seedream 4.5 support batch outputs and multi-reference input, so you can generate multiple product variations in a single request. Fast inference makes them practical for large catalogs.
Nano Banana and Nano Banana Pro are among the best conversational image editors available. They follow natural-language instructions accurately and support multi-image input:
GPT Image 1.5 is great at targeted edits that preserve everything you didn't ask to change — swap an outfit, adjust lighting, or translate text in an image while keeping the layout intact.
For object removal specifically, Bria Eraser and Bria GenFill are designed for clean removal and visual continuity.
Ideogram v3 excels at:
Need faster results? Try Ideogram v3 Turbo. Or learn more about running Ideogram models with an API.
Nano Banana Pro also handles multilingual text rendering well, with clear typography in multiple languages and varied textures.
Grok Imagine Image renders readable text within images better than most — useful for posters, social media graphics, and designs that need clear lettering.
Nano Banana Pro accepts up to 14 input images and blends them into cohesive compositions. Combine product photos with lifestyle scenes, merge reference images, or layer elements from different sources.
FLUX.2 Max supports up to 8 reference images via API (10 in the Playground). Point to specific images by index to control which elements come from where — "use the face from image 1 and the background from image 3."
GPT Image 1.5 handles multi-image input for scene compositing and style blending. (Requires an OpenAI API key.)
Test different editing approaches in the playground. Compare models side by side to find what works best for your project.
Want to learn more about inpainting? Check out our guide →
Questions? Join us on Discord.
Featured models

High-quality image generation and editing with support for eight reference images
Updated 6Â days, 5Â hours ago
3.9M runs

Google's state of the art image generation and editing model 🍌🍌
Updated 2Â weeks, 1Â day ago
17.2M runs

Use this ultra version of Imagen 4 when quality matters more than speed and cost
Updated 1Â month ago
1.5M runs

SOTA image model from xAI
Updated 1Â month ago
81.6K runs

Google's latest image editing model in Gemini 2.5
Updated 1Â month ago
91.1M runs

openai/gpt-image-1.5OpenAI's latest image generation model with better instruction following and adherence to prompts
Updated 1Â month, 2Â weeks ago
5.1M runs

The highest fidelity image model from Black Forest Labs
Updated 2Â months, 3Â weeks ago
1M runs

bytedance/seedream-4.5Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge
Updated 3Â months, 1Â week ago
4.7M runs

bytedance/seedream-4Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
Updated 3Â months, 2Â weeks ago
29.3M runs
Recommended Models
If you want quick edits to an image, google/nano-banana is a strong choice—it handles editing using simple instructions in text, and it supports multi-image input.
Another fast option is bytedance/seedream-4, which supports editing at higher resolution without huge compute time.
For reliable edits with good prompt following and identity preservation, black-forest-labs/flux-kontext-pro offers a solid middle ground.
If you’re working on a more premium workflow (typography, precise edits, full stylization), black-forest-labs/flux-kontext-max scales up quality and control.
If you need to remove or swap items (for example a sign, person, or piece of furniture), models like bria/eraser and bria/genfill are designed for clean object removal and visual continuity.
For broader edits—such as changing a background or adding new elements while retaining structure—black-forest-labs/flux-kontext-pro works well with directed prompts.
If your edit centers on changing style (for instance “make this image look like a Studio Ghibli painting”) or adding text that looks like it belongs, ideogram-ai/ideogram-v3 is excellent for natural typography and stylized inpainting.
For depth-aware or edge-preserving edits (e.g., changing pose or structure while keeping main subjects intact), black-forest-labs/flux-depth-pro or black-forest-labs/flux-canny-pro provide more control.
There are two main approaches:
You’ll typically get an edited image file (same resolution or slightly changed depending on settings) where the requested modifications have been applied.
Some models also output additional metadata or allow multi-image input (for example combining two photos or layering edits) depending on the version.
If the model is open source, you can clone the repo and run it locally using Cog or Docker.
To publish your own model, prepare a replicate.yaml file that defines inputs (image, mask, prompt) and outputs, then push it to your Replicate account for use on managed hardware.
Yes—many image-editing models support commercial use. Always check the License section on each model’s page to confirm.
Also ensure you have rights to the image you are editing, especially if you plan to publish or monetize the output.
Open a model’s page on Replicate, upload your image (and optional mask or reference image), and enter a prompt describing your edit (e.g., “change the car colour to red and add a banner”).
The model will return a modified image you can download. Some models support further options like preserving identity, controlling style strength, or combining multiple inputs.
Recommended Models

bytedance/seedream-5-liteSeedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge
Updated 2Â weeks, 2Â days ago
113.6K runs

Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand to the desired aspect ratio. Trained exclusively on licensed data for safe and risk-free commercial use
Updated 3Â weeks, 1Â day ago
354.6K runs

FIBO-Edit brings the power of structured prompt generation to image editing
Updated 1Â month ago
255 runs

SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.
Updated 1Â month, 1Â week ago
12K runs

Bria Background Generation allows for efficient swapping of backgrounds in images via text prompts or reference image, delivering realistic and polished results. Trained exclusively on licensed data for safe and risk-free commercial use
Updated 1Â month, 1Â week ago
53.4K runs

SOTA Object removal, enables precise removal of unwanted objects from images while maintaining high-quality outputs. Trained exclusively on licensed data for safe and risk-free commercial use
Updated 1Â month, 1Â week ago
333.5K runs

Bria GenFill enables high-quality object addition or visual transformation. Trained exclusively on licensed data for safe and risk-free commercial use.
Updated 1Â month, 1Â week ago
16K runs

prunaai/p-image-editA sub 1 second 0.01$ multi-image editing model built for production use cases. For image generation, check out p-image here: https://replicate.com/prunaai/p-image
Updated 3Â months, 1Â week ago
19.1M runs

An experimental FLUX Kontext model that can combine two input images
Updated 4Â months ago
225.6K runs

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts
Updated 4Â months ago
10.5M runs

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language
Updated 4Â months ago
47.2M runs

Use FLUX Kontext to restore, fix scratches and damage, and colorize old photos
Updated 4Â months ago
1M runs

An experimental model with FLUX Kontext Pro that can combine two input images
Updated 4Â months ago
2.4M runs

FLUX Kontext max with list input for multiple images
Updated 4Â months ago
177.6K runs

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.
Updated 4Â months ago
3.8M runs

ideogram-ai/ideogram-v2An excellent image model with state of the art inpainting, prompt comprehension and text rendering
Updated 4Â months ago
2.7M runs

ideogram-ai/ideogram-v3-qualityThe highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 4Â months ago
2.1M runs

ideogram-ai/ideogram-v2-turboA fast image model with state of the art inpainting, prompt comprehension and text rendering.
Updated 4Â months ago
2.9M runs

prunaai/flux-kontext-fastUltra fast flux kontext endpoint
Updated 4Â months ago
20.1M runs

The latest Qwen-Image’s iteration with improved multi-image editing, single-image consistency, and native support for ControlNet
Updated 5Â months, 2Â weeks ago
9.8M runs

openai/gpt-image-1A multimodal image generation model that creates high-quality images. You need to bring your own verified OpenAI key to use this model. Your OpenAI account will be charged for usage.
Updated 5Â months, 2Â weeks ago
1.6M runs

Edit images using a prompt. This model extends Qwen-Image’s unique text rendering capabilities to image editing tasks, enabling precise text editing
Updated 6Â months, 3Â weeks ago
1.7M runs

fofr/color-matcherColor match and white balance fixes for images
Updated 6Â months, 3Â weeks ago
220K runs

Open-weight version of FLUX.1 Kontext
Updated 8Â months, 2Â weeks ago
6.8M runs

lucataco/omnigen2OmniGen2: a powerful and efficient unified multimodal model
Updated 8Â months, 2Â weeks ago
2.5K runs

bytedance/bagel🥯ByteDance Seed's Bagel Unified multimodal AI that generates images, edits images, and understands images in one 7B parameter model🥯
Updated 9Â months, 2Â weeks ago
275K runs

zsxkib/step1x-edit✍️Step1X-Edit by stepfun-ai, Edit an image using text prompt📸
Updated 10Â months, 1Â week ago
15.8K runs

orpatashnik/styleclipText-Driven Manipulation of StyleGAN Imagery
Updated 3Â years, 4Â months ago
1.3M runs