Explore

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models
layerize

ideogram-ai / layerize

Take a flat graphic, remove text, and get structured text layers back for editing and recomposing

75 runs
Official

bytedance / seedance-2.0-fast

A faster variant of Seedance 2.0 for quicker video generation with multimodal inputs and native audio.

544 runs
Official

bytedance / seedance-2.0

ByteDance's multimodal video generation model with native audio, multimodal reference inputs, and intelligent duration control.

5.6K runs
Official
lyria-3-pro

google / lyria-3-pro

Generate full-length songs up to 3 minutes from text prompts or images with Lyria 3 Pro, Google's most capable music generation model

1.9K runs
Official
lyria-3

google / lyria-3

Generate 30-second music clips from text prompts or images with Lyria 3, Google's music generation model

1.6K runs
Official
wan-2.7-image-pro

wan-video / wan-2.7-image-pro

Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation

10.2K runs
Official
wan-2.7-image

wan-video / wan-2.7-image

Generate and edit images with Alibaba's Wan 2.7

3.1K runs
Official
veo-3.1-lite

google / veo-3.1-lite

Google's cost-efficient video generation model with native audio, optimized for high-volume applications

5.9K runs
Official

wan-video / wan-2.7-videoedit

Edit videos with natural language instructions using Alibaba's Wan 2.7 VideoEdit model

784 runs
Official
wan-2.7-r2v

wan-video / wan-2.7-r2v

Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-video model

444 runs
Official
wan-2.7-i2v

wan-video / wan-2.7-i2v

Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model

2.4K runs
Official

wan-video / wan-2.7-t2v

Generate videos with audio from text prompts using Alibaba's Wan 2.7 model. 1080p, up to 15 seconds, with audio synchronization.

644 runs
Official

xai / grok-imagine-r2v

Generate videos guided by reference images using xAI's Grok Imagine Video model

5K runs
Official

xai / grok-imagine-video-extension

Extend videos with xAI's Grok Imagine Video model. Provide a source video and describe what happens next.

1.3K runs
Official
tts-1.5-mini

inworld / tts-1.5-mini

Ultra-fast, cost-efficient text-to-speech with ~120ms latency and 15-language support

9.1K runs
Official
tts-1.5-max

inworld / tts-1.5-max

Highest-quality text-to-speech with <200ms latency, emotion control, and 15-language support

34.1K runs
Official
p-image-upscale

prunaai / p-image-upscale

Very efficient image upscaler supporting outputs up to 8 MP. Upscales images to 4 MP in under one second.

6.2K runs
Official

lightricks / ltx-2.3-pro

High-fidelity video generation with portrait support, audio-to-video, retake, and extend. Text, image, and audio-driven creation up to 4K at 50 FPS.

7.3K runs
Official
gpt-5.4

openai / gpt-5.4

OpenAI's most capable frontier model for complex professional work, coding, and multi-step reasoning.

27.5K runs
Official

lightricks / ltx-2.3-fast

Lightning-fast video generation with portrait support, camera controls, and synchronized audio. Up to 20 seconds at 1080p, 4K at 50 FPS.

8.7K runs
Official

OCR to extract text from images

Use AI For Optical Character Recognition (OCR) to extract text from images via API

Classify text

Classify text by sentiment, topic, intent, or safety

Create realistic face swaps

Replace faces across images with natural-looking results.

Official models

Official models are always on, predictably priced, and have a stable API.

Large Language Models (LLMs)

Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API

Try AI models for free

Try AI Models for free: video generation, image generation, upscaling, and photo restoration

Vision models

Chat with images for understanding, captioning & detection via API

Object detection and segmentation

Use AI object detection and segmentation models to distinguish objects in images & videos

Flux fine-tunes

Flux fine-tunes: build and run custom AI image models via API

Media utilities

AI media utilities: auto-caption, watermark, frame extraction & more via API

Qwen-Image fine-tunes

Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.

WAN family of models

WAN family of models: powerful image-to-video & text-to-video models

Latest models