Explore

Featured models

openai/gpt-5-structured

GPT-5 with support for structured outputs, web search and custom tools

26.4K runs

Official

qwen/qwen-image

An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.

288.7K runs

Official

google/nano-banana

Google's latest image editing model in Gemini 2.5

2.9M runs

Official

kwaivgi/kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

944.9K runs

Official

minimax/hailuo-02

Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.

77.7K runs

Official

deepseek-ai/deepseek-v3.1

Latest hybrid thinking model from Deepseek

2.1K runs

Official

pixverse/pixverse-v5

Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions

9.3K runs

Official

qwen/qwen-image-lora-trainer

Fine-tunable Qwen Image model with exceptional composition abilities - train custom LoRAs for any style or subject

139 runs

Official

qwen/qwen-image-edit

Edit images using a prompt. This model extends Qwen-Image’s unique text rendering capabilities to image editing tasks, enabling precise text editing

196.2K runs

Official

wan-video/wan-2.2-t2v-fast

A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video

61.1K runs

Official

prunaai/wan-2.2-image

This model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package

238.9K runs

Official

bytedance/seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

1.4M runs

Official

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

google/imagen-4-fast

Use this fast version of Imagen 4 when speed and cost are more important than quality

633.6K runs

Official

kwaivgi/kling-v1.6-pro

Generate 5s and 10s videos in 1080p resolution

701.3K runs

Official

black-forest-labs/flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

20.9M runs

Official

runwayml/upscale-v1

Upscale videos by 4x, up to a maximum of 4k

3.6K runs

Official

black-forest-labs/flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

341.4K runs

Official

google/veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

154.4K runs

Official

google/veo-3-fast

A faster and cheaper version of Google’s Veo 3 video model, with audio

41.1K runs

Official

ibm-granite/granite-3.3-8b-instruct

Granite-3.3-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for improved reasoning and instruction-following capabilities.

1.2M runs

Official

kwaivgi/kling-lip-sync

Add lip-sync to any video with an audio file or text

11.6K runs

Official

openai/gpt-5-structured

GPT-5 with support for structured outputs, web search and custom tools

26.4K runs

Official

qwen/qwen-image

An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.

288.7K runs

Official

bytedance/seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

402.2K runs

Official

bytedance/seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

710.9K runs

Official

google/gemini-2.5-flash-image

Google's latest image generation model in Gemini 2.5

25.2K runs

Official

google/nano-banana

Google's latest image editing model in Gemini 2.5

2.9M runs

Official

kwaivgi/kling-v2.1-master

A premium version of Kling v2.1 with superb dynamics and prompt adherence. Generate 1080p 5s and 10s videos from text or an image

39.8K runs

Official

kwaivgi/kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

944.9K runs

Official

kwaivgi/kling-v1.5-pro

Generate 5s and 10s videos in 1080p resolution at 30fps

992 runs

Official

kwaivgi/kling-v1.5-standard

Generate 5s and 10s videos in 720p resolution at 30fps

597 runs

Official

kwaivgi/kling-v1.6-standard

Generate 5s and 10s videos in 720p resolution at 30fps

1.2M runs

Official

I want to…

Generate images

Use AI To Generate Images & Photos with an API

Caption videos

Use AI To Caption Videos with an API

Generate speech

Convert text to speech

Use a face to make images

Make realistic images of people instantly

Generate videos

Use AI To Generate Videos with an API

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate music

Use AI To Generate Music with an API

Edit images

Use AI To Edit Any Image with an API

Transcribe speech

Models that convert speech to text

Extract text from images

Optical character recognition (OCR) and text extraction

Remove backgrounds

Models that remove backgrounds from images and videos

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Upscaling models that create high-quality video from low-quality videos

Use Qwen-Image fine-tunes

Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate

Use LLMs

Models that can understand and generate text

Use handy tools

Toolbelt-type models for videos and images.

Edit Videos

Tools for editing videos.

Caption images

Use AI To Caption Images with an API

Videos from images

Use AI To Generate Videos from images with an API

Make videos with Wan

Generate videos with Wan, the fastest and highest quality open-source video generation model.

Use Kontext fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Chat with images

Ask language models about images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use official models

Official models are always on, maintained, and have predictable pricing.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

google/imagen-4-fast

Use this fast version of Imagen 4 when speed and cost are more important than quality

633.6K runs

Official

kwaivgi/kling-v1.6-pro

Generate 5s and 10s videos in 1080p resolution

701.3K runs

Official

black-forest-labs/flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

20.9M runs

Official

runwayml/upscale-v1

Upscale videos by 4x, up to a maximum of 4k

3.6K runs

Official

tencent/hunyuanvideo-foley

(Research & Non-commercial use only) Text-Video-to-Audio Synthesis: Generate realistic audio from video and text descriptions

6 runs

black-forest-labs/flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

341.4K runs

Official

google/veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

154.4K runs

Official

google/veo-3-fast

A faster and cheaper version of Google’s Veo 3 video model, with audio

41.1K runs

Official

andreasjansson/heineken-commercial

https://x.com/grok/status/1963522967511650331

13 runs

andreasjansson/landscaping

Make your garden pretty

8 runs

kojott/content-moderation-vision

AI-powered content moderation for images using MiniCPM-V-2.6 - analyzes visual content and returns structured safety scores with detailed classifications

30 runs

fofr/face-swap-with-ideogram

Use ideogram-character to face-swap someone into a target image

6.2K runs

Explore

Fine-tune Qwen-Image

Harness Qwen-Image's exceptional composition abilities to produce your own custom model.

Announcing Replicate's remote MCP server

How to prompt Veo 3 with images

Open source video is back

Featured models

Official models

google/imagen-4-fast

kwaivgi/kling-v1.6-pro

black-forest-labs/flux-kontext-pro

runwayml/upscale-v1

black-forest-labs/flux-canny-pro

google/veo-3

google/veo-3-fast

ibm-granite/granite-3.3-8b-instruct

kwaivgi/kling-lip-sync

openai/gpt-5-structured

qwen/qwen-image

bytedance/seedance-1-pro

bytedance/seedance-1-lite

google/gemini-2.5-flash-image

google/nano-banana

kwaivgi/kling-v2.1-master

kwaivgi/kling-v2.1

kwaivgi/kling-v1.5-pro

kwaivgi/kling-v1.5-standard

kwaivgi/kling-v1.6-standard

I want to…

Latest models