Explore

Featured models

resemble-ai/chatterbox-turbo

The fastest open source TTS model without sacrificing quality.

444 runs

Official

openai/gpt-5.2

The best model for coding and agentic tasks across industries

118.6K runs

Official

bytedance/seedream-4.5

Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge

176.2K runs

Official

prunaai/z-image-turbo

Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

627.5K runs

Official

lightricks/ltx-2-retake

Take any shot and edit specific sections. Rephrase, change the action, camera angles and more

666 runs

Official

google/gemini-3-pro

Google's most advanced reasoning Gemini model

49.4K runs

Official

black-forest-labs/flux-2-pro

High-quality image generation and editing with support for eight reference images

428.2K runs

Official

google/nano-banana-pro

Google's state of the art image generation and editing model 🍌🍌

4.6M runs

Official

prunaai/p-image-edit

A sub 1 second 0.01$ multi-image editing model built for production use cases. For image generation, check out p-image here: https://replicate.com/prunaai/p-image

707.3K runs

Official

google/veo-3.1

New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support

188.6K runs

Official

openai/sora-2

OpenAI's Flagship video generation with synced audio

140.3K runs

Official

philz1337x/crystal-upscaler

High-precision image upscaler optimized for portraits, faces and products. One of the upscale modes powered by Clarity AI. X:https://x.com/philz1337x

255.6K runs

Official

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

wan-video / wan-2.6-i2v

Alibaba Wan 2.6 image to video generation model

45 runs

Official

wan-video / wan-2.6-t2v

Alibaba Wan 2.6 text to video generation model

43 runs

Official

resemble-ai / chatterbox-turbo

The fastest open source TTS model without sacrificing quality.

444 runs

Official

openai / gpt-5.2

The best model for coding and agentic tasks across industries

118.6K runs

Official

sync / react-1

Realistic lipsync with refined human emotion capabilities

75 runs

Official

veed / fabric-1.0

VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video

227 runs

Official

bytedance / seedream-4.5

Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge

176.2K runs

Official

prunaai / z-image-turbo

Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

627.5K runs

Official

black-forest-labs / flux-2-flex

Max-quality image generation and editing with support for ten reference images

43.4K runs

Official

black-forest-labs / flux-2-dev

Quality image generation and editing with support for reference images

84.9K runs

Official

lightricks / ltx-2-retake

Take any shot and edit specific sections. Rephrase, change the action, camera angles and more

666 runs

Official

google / gemini-3-pro

Google's most advanced reasoning Gemini model

49.4K runs

Official

hyper3d / rodin

Generate complex 3D models from images with Rodin Gen-2

1.1K runs

Official

black-forest-labs / flux-2-pro

High-quality image generation and editing with support for eight reference images

428.2K runs

Official

openai / gpt-5.1

The best model for coding and agentic tasks with configurable reasoning effort.

173.2K runs

Official

qwen-edit-apps / qwen-image-edit-plus-lora-fusion

Fusion – Product/object blending that fixes perspective and lighting so the subject melts into a new background via the Fusion LoRA.

578 runs

Official

qwen-edit-apps / qwen-image-edit-plus-lora-relight

Relight – Soft, curtain-filtered relighting that repaints the scene with golden-hour or moody tones using the Relight LoRA.

920 runs

Official

qwen-edit-apps / qwen-image-edit-plus-lora-upscale

Upscale – Detail-loving upscale/restore pass that sharpens textures and color fidelity with the Upscale LoRA.

460 runs

Official

qwen-edit-apps / qwen-image-edit-plus-lora-next-scene

Next Scene – “Next beat” cinematic edits that keep subject identity while steering to the next camera move via the Next Scene LoRA

4.1K runs

Official

qwen-edit-apps / qwen-image-edit-plus-lora-skin

Skin – Natural beauty retouch that enhances pores and tonal variation (no plastic skin) via the Skin LoRA.

918 runs

Official

I want to…

View all collections

Generate images

Use AI to generate images & photos with an API

Generate speech

Use AI for text-to-speech or to clone your voice via API

Generate images from a face

Use AI to generate images from a face with an API

Upscale images with super resolution

Use AI to upscale images with super resolution with an API

Transcribe speech to text

Use AI to transcribe speech to text via API

OCR to extract text from images

Use AI For Optical Character Recognition (OCR) to extract text from images via API

Remove backgrounds

Use AI to remove backgrounds from images and videos with an API

FLUX family of models

FLUX AI models: advanced image generation & editing via API

Enhance videos

Use AI to enhance videos via API - Replicate

Classify text

Classify text by sentiment, topic, intent, or safety

Speaker diarization

Identify speakers from audio and video inputs

Create realistic face swaps

Replace faces across images with natural-looking results.

Turn sketches into images

Transform rough sketches into polished visuals

Generate emojis

Generate custom emojis from text or images

Generate anime-style images and videos

Create anime-style characters, scenes, and animations

Generate videos from images

Use AI to Generate Videos from Images with API

Lipsync videos

Use AI to generate lipsync videos with an API

Vision models

Chat with images for understanding, captioning & detection via API

Large Language Models (LLMs)

Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API

Try AI models for free

Try AI Models for free: video generation, image generation, upscaling, and photo restoration

Control image generation

Use AI to control image generation with an API

Embedding models

Embedding models for AI search and analysis

Object detection and segmentation

Use AI object detection and segmentation models to distinguish objects in images & videos

Official AI models

Official AI models: Always available, stable, and predictably priced

Flux fine-tunes

Flux fine-tunes: build and run custom AI image models via API

Kontext fine-tunes

Kontext fine-tunes: Build custom AI image models with an API

Create songs with voice cloning

Create songs with voice cloning models via API

Media utilities

AI media utilities: auto-caption, watermark, frame extraction & more via API

Qwen-Image fine-tunes

Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.

WAN family of models

WAN family of models: powerful image-to-video & text-to-video models

Latest models

nvidia / nemotron-3-nano-30b-a3b

Nemotron-3-Nano-30B-A3B is a large language model (LLM) trained from scratch by NVIDIA

216 runs

wan-video / wan-2.6-i2v

Alibaba Wan 2.6 image to video generation model

45 runs

Official

wan-video / wan-2.6-t2v

Alibaba Wan 2.6 text to video generation model

43 runs

Official

vufinder / depth-anything-v3-metric

Monocular metric depth estimation

1 run

souterdelilah-bit / delilahsouter

4 runs

resemble-ai / chatterbox-turbo

The fastest open source TTS model without sacrificing quality.

444 runs

Official

nightowlstudio-11 / clipforge

Add Text & Music to Videos

3.5K runs

jasonod888 / moge2

Outputs Depth, Normal & Point Cloud for a Given Input Image

1 run

leadssolution2 / aiavatar

7 runs

vufinder / depth-anything-v3-mono

Monocular relative depth estimation

14 runs

prunaai / qwen-image-fast

Qwen-Image optimized by Pruna AI. Generates high fidelity 1.5MP images in 1s. Model will be priced at 0.01$ in January.

1.3K runs

tattzy25 / famous-flux

A custom Flux 1 Dev LoRA trained on ~50 diverse images blending portrait photography with automotive aesthetics. Use the trigger word FAMOSOFLUXO to activate the style. Perfect for creating unique fusion imagery combining human subjects with car culture e

39 runs

FLUX.2 [pro]

Black Forest Labs' most advanced image generation model yet.

Run Isaac 0.1 on Replicate

Run FLUX.2 on Replicate

How to prompt Nano Banana Pro

Featured models

Official models

wan-video / wan-2.6-i2v

wan-video / wan-2.6-t2v

resemble-ai / chatterbox-turbo

openai / gpt-5.2

sync / react-1

veed / fabric-1.0

bytedance / seedream-4.5

prunaai / z-image-turbo

black-forest-labs / flux-2-flex

black-forest-labs / flux-2-dev

lightricks / ltx-2-retake

google / gemini-3-pro

hyper3d / rodin

black-forest-labs / flux-2-pro

openai / gpt-5.1

qwen-edit-apps / qwen-image-edit-plus-lora-fusion

qwen-edit-apps / qwen-image-edit-plus-lora-relight

qwen-edit-apps / qwen-image-edit-plus-lora-upscale

qwen-edit-apps / qwen-image-edit-plus-lora-next-scene

qwen-edit-apps / qwen-image-edit-plus-lora-skin

I want to…

Latest models

Run Isaac 0.1 on Replicate

Run FLUX.2 on Replicate

How to prompt Nano Banana Pro