Explore

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models
gemini-3.1-flash-tts

google / gemini-3.1-flash-tts

Google's fast, expressive text-to-speech model with 30 voices and 70+ language support

995 runs
Official
music-cover

minimax / music-cover

Reimagine any song in a different style — change voice, instruments, genre, and arrangement while keeping the original melody

209 runs
Official
music-2.6

minimax / music-2.6

Generate full-length songs or instrumentals from a text prompt, with optional auto-generated lyrics

771 runs
Official

decart / lucy-edit-2

Edit and transform videos with text prompts and reference images. Style transfers, object replacement, character transformation, and more.

23 runs
Official
layerize

ideogram-ai / layerize

Take a flat graphic, remove text, and get structured text layers back for editing and recomposing

1K runs
Official

bytedance / seedance-2.0-fast

A faster variant of Seedance 2.0 for quicker video generation with multimodal inputs and native audio.

9.6K runs
Official

bytedance / seedance-2.0

ByteDance's multimodal video generation model with native audio, multimodal reference inputs, and intelligent duration control.

40.4K runs
Official
lyria-3-pro

google / lyria-3-pro

Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music generation model

3.6K runs
Official
lyria-3

google / lyria-3

Generate 30-second music clips from text prompts or images with Lyria 3, Google's music generation model

2.2K runs
Official
wan-2.7-image-pro

wan-video / wan-2.7-image-pro

Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation

17K runs
Official
wan-2.7-image

wan-video / wan-2.7-image

Generate and edit images with Alibaba's Wan 2.7

5.6K runs
Official
veo-3.1-lite

google / veo-3.1-lite

Google's cost-efficient video generation model with native audio, optimized for high-volume applications

9.7K runs
Official

wan-video / wan-2.7-videoedit

Edit videos with natural language instructions using Alibaba's Wan 2.7 VideoEdit model

1.2K runs
Official
wan-2.7-r2v

wan-video / wan-2.7-r2v

Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-video model

647 runs
Official
wan-2.7-i2v

wan-video / wan-2.7-i2v

Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model

4.9K runs
Official

wan-video / wan-2.7-t2v

Generate videos with audio from text prompts using Alibaba's Wan 2.7 model. 1080p, up to 15 seconds, with audio synchronization.

978 runs
Official

xai / grok-imagine-r2v

Generate videos guided by reference images using xAI's Grok Imagine Video model

5.9K runs
Official

xai / grok-imagine-video-extension

Extend videos with xAI's Grok Imagine Video model. Provide a source video and describe what happens next.

1.7K runs
Official
tts-1.5-mini

inworld / tts-1.5-mini

Ultra-fast, cost-efficient text-to-speech with ~120ms latency and 15-language support

13.5K runs
Official
tts-1.5-max

inworld / tts-1.5-max

Highest-quality text-to-speech with <200ms latency, emotion control, and 15-language support

46.7K runs
Official

OCR to extract text from images

Use AI For Optical Character Recognition (OCR) to extract text from images via API

Classify text

Classify text by sentiment, topic, intent, or safety

Create realistic face swaps

Replace faces across images with natural-looking results.

Vision models

Chat with images — visual Q&A, analysis, and reasoning via API

Caption Images

Use AI to generate captions and descriptions from images with an API

WAN family of models

WAN family of models: open-source video, image, and audio generation

Create 3D content

Generate 3D objects, meshes, and textures from text or images with an API

Official models

Official models are always on, predictably priced, and have a stable API.

Large Language Models (LLMs)

Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API

Try AI models for free

Try AI Models for free: video generation, image generation, upscaling, and photo restoration

Object detection and segmentation

Use AI object detection and segmentation models to distinguish objects in images & videos

Flux fine-tunes

Flux fine-tunes: build and run custom AI image models via API

Media utilities

AI media utilities: auto-caption, watermark, frame extraction & more via API

Qwen-Image fine-tunes

Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.

Latest models