Explore

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models
tts-1.5-mini

inworld / tts-1.5-mini

Ultra-fast, cost-efficient text-to-speech with ~120ms latency and 15-language support

268 runs
Official
tts-1.5-max

inworld / tts-1.5-max

Highest-quality text-to-speech with <200ms latency, emotion control, and 15-language support

543 runs
Official

lightricks / ltx-2.3-pro

High-fidelity video generation with portrait support, audio-to-video, retake, and extend. Text, image, and audio-driven creation up to 4K at 50 FPS.

1.5K runs
Official
gpt-5.4

openai / gpt-5.4

OpenAI's most capable frontier model for complex professional work, coding, and multi-step reasoning.

3K runs
Official

lightricks / ltx-2.3-fast

Lightning-fast video generation with portrait support, camera controls, and synchronized audio. Up to 20 seconds at 1080p, 4K at 50 FPS.

2.2K runs
Official

kwaivgi / kling-v3-motion-control

Kling 3.0 motion control: transfer motion from a reference video to any character image with improved consistency and quality.

7.9K runs
Official

vidu / q3-turbo

Fast video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.

121 runs
Official
qwen-image-2-pro

qwen / qwen-image-2-pro

The pro version of Qwen Image 2 from Alibaba's Qwen team. Enhanced text rendering, realism, and semantic adherence for high-quality image generation and editing.

2.3K runs
Official
qwen-image-2

qwen / qwen-image-2

A next-generation image generation and editing model from Alibaba's Qwen team. Supports text-to-image and image editing with strong text rendering, especially for Chinese.

2.1K runs
Official

heygen / avatar-iv

Create realistic talking avatar videos from text with HeyGen's Avatar IV engine

86 runs
Official
music-2.5

minimax / music-2.5

Generate full-length songs with vocals, lyrics, and rich instrumentation from a text prompt

707 runs
Official

vidu / q3-pro

High-fidelity video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.

463 runs
Official

heygen / video-agent

Turn a text prompt into a complete, polished video with AI-generated script, avatar presenter, voiceover, visuals, and editing.

136 runs
Official
seedream-5-lite

bytedance / seedream-5-lite

Seedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge

115.1K runs
Official
gemini-3.1-pro

google / gemini-3.1-pro

Google's most intelligent model, with improved reasoning and a new medium thinking level

42.4K runs
Official

runwayml / gen-4.5

State-of-the-art video motion quality, prompt adherence and visual fidelity

26.5K runs
Official
recraft-v4-pro-svg

recraft-ai / recraft-v4-pro-svg

Generate detailed SVG vector graphics from text prompts. Recraft V4 Pro's design taste with more geometric detail and finer paths — clean layers, editable output, and scalable to any size.

2.3K runs
Official
recraft-v4-svg

recraft-ai / recraft-v4-svg

Generate production-ready SVG vector images from text prompts. Recraft V4's design taste applied to vector output — clean geometry, structured layers, and editable paths.

5.3K runs
Official
recraft-v4-pro

recraft-ai / recraft-v4-pro

Recraft's latest image generation model at ~2048px resolution. Same design taste and prompt accuracy as V4, with higher resolution for print-ready and large-scale work.

4.3K runs
Official
recraft-v4

recraft-ai / recraft-v4

Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, and integrated text rendering. Fast and cost-efficient at standard resolution.

25.3K runs
Official

OCR to extract text from images

Use AI For Optical Character Recognition (OCR) to extract text from images via API

Classify text

Classify text by sentiment, topic, intent, or safety

Create realistic face swaps

Replace faces across images with natural-looking results.

Official models

Official models are always on, predictably priced, and have a stable API.

Large Language Models (LLMs)

Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API

Try AI models for free

Try AI Models for free: video generation, image generation, upscaling, and photo restoration

Vision models

Chat with images for understanding, captioning & detection via API

Object detection and segmentation

Use AI object detection and segmentation models to distinguish objects in images & videos

Flux fine-tunes

Flux fine-tunes: build and run custom AI image models via API

Media utilities

AI media utilities: auto-caption, watermark, frame extraction & more via API

Qwen-Image fine-tunes

Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.

WAN family of models

WAN family of models: powerful image-to-video & text-to-video models

Latest models