Explore

Featured models

Foundation image model from Krea, tuned for expressive illustration, anime, and painterly styles. Fast and consistent across artistic directions.

20.6K runs

Official

alibaba/happyhorse-1.0

Alibaba's Happy Horse 1.0 generates videos from text prompts or animates a single image into video. Supports 720p and 1080p, 3-15 second durations, and five aspect ratios.

30.2K runs

Official

openai/gpt-image-2

OpenAI's state-of-the-art image generation model. Create and edit images from text with strong instruction following, sharp text rendering, and detailed editing.

17.2M runs

Official

anthropic/claude-opus-4.7

Anthropic's most capable model with a step-change improvement in agentic coding, better vision, and stronger multi-step reasoning

222.2K runs

Official

google/gemini-3.1-flash-tts

Google's fast, expressive text-to-speech model with 30 voices and 70+ language support

363.9K runs

Official

minimax/music-2.6

Generate full-length songs or instrumentals from a text prompt, with optional auto-generated lyrics

23.8K runs

Official

bytedance/seedance-2.0

ByteDance's multimodal video generation model with native audio, multimodal reference inputs, and intelligent duration control.

1.2M runs

Official

prunaai/p-video-avatar

p-video-avatar is the fastest and cheapest avatar/lipsync video model on the market.

122.9K runs

Official

bytedance/seedream-5-lite

Seedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge

3.1M runs

Official

xai/grok-imagine-video

Generate videos using xAI's Grok Imagine Video model

1.4M runs

Official

black-forest-labs/flux-2-max

The highest fidelity image model from Black Forest Labs

3.8M runs

Official

google/nano-banana-2

Google's fast image generation model with conversational editing, multi-image fusion, and character consistency

15.6M runs

Official

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

anthropic / claude-sonnet-5

Anthropic's most agentic Sonnet model, bringing frontier-level coding and tool use at Sonnet's speed and price

2.8K runs

Official

reve / reve-2.1

Generate and edit images from text and reference images with Reve 2.1

1.3K runs

Official

openai / gpt-5.6-luna

OpenAI's GPT-5.6 cost-optimized tier, built for fast, high-volume, latency-sensitive workloads.

16.3K runs

Official

openai / gpt-5.6-terra

OpenAI's GPT-5.6 balanced tier, tuned for everyday production work at roughly half the cost of the flagship.

4.9K runs

Official

openai / gpt-5.6-sol

OpenAI's GPT-5.6 flagship tier, built for complex professional work, coding, and deep multi-step reasoning.

11.5K runs

Official

bytedance / seedream-5-pro

ByteDance's flagship text-to-image and image editing model, generating sharp 1K and 2K images from text or up to 10 reference images

48.9K runs

Official

google / nano-banana-2-lite

Google's fastest image generation model — the lightweight, low-cost version of Nano Banana 2, for rapid creation and editing

221.1K runs

Official

qwen / qwen3-7-plus

Qwen3.7-Plus is Alibaba's cost-effective multimodal model with vision-language understanding, a 1 million token context window, and strong agentic coding and tool use.

19.9K runs

Official

bytedance / seedance-2.0-mini

A lower-cost variant of Seedance 2.0 for high-volume video generation with multimodal inputs and native audio.

31.8K runs

Official

alibaba / happyhorse-1.1

Alibaba's Happy Horse 1.1 generates videos from text, animates a single image, or builds a video from multiple reference images. Supports 720p and 1080p, 3-15 second durations, and five aspect ratios.

10.5K runs

Official

sourceful / riverflow-v2.5-pro

Top-quality agentic image model with multi-step reasoning, candidate scoring, and adjustable thinking effort

1.7K runs

Official

sourceful / riverflow-v2.5-fast

Speed-optimized variant of Riverflow 2.5 for production and latency-sensitive workflows

1.2K runs

Official

luma / ray-3.2

Luma's reasoning video model. Generates cinematic 5s or 10s video from text or images, with native HDR and EXR export for professional production pipelines.

2.2K runs

Official

anthropic / claude-fable-5

Claude Fable 5 from Anthropic: the next generation of intelligence for the hardest knowledge work and coding problems.

10.9K runs

Official

ideogram-ai / ideogram-v4-quality

The highest quality Ideogram v4 model. v4 creates images with stunning realism, creative designs, and consistent styles

19.3K runs

Official

ideogram-ai / ideogram-v4-balanced

Balance speed, quality and cost. Ideogram v4 creates images with stunning realism, creative designs, and consistent styles

6.3K runs

Official

runwayml / aleph-2

Edit one frame to update an entire video. Aleph 2.0 is Runway's in-context video editor: longer clips (up to 30s), multi-shot edits, and image-level precision via keyframe references.

1.8K runs

Official

xai / grok-imagine-video-1.5

Image-to-video with synchronized audio using xAI's Grok Imagine Video 1.5 preview model

139.7K runs

Official

krea / krea-2-large

Krea's flagship foundation image model. Larger and more flexible than Krea 2 Medium, with particular strength in photorealism and expressive artistic styles.

3.6K runs

Official

krea / krea-2-medium

Foundation image model from Krea, tuned for expressive illustration, anime, and painterly styles. Fast and consistent across artistic directions.

20.6K runs

Official

I want to…

View all collections

Generate images

Use AI to generate images & photos with an API

Caption videos

Use AI to understand, describe, and caption videos with an API

Generate speech

Use AI for text-to-speech or to clone your voice via API

Generate images from a face

Use AI to generate images from a face with an API

Generate videos

Use AI to generate videos with an API

Upscale images with super resolution

Use AI to upscale and enhance images with an API

Generate music

Use AI to generate music with an API

Edit any image

Use AI to edit any image via API

Transcribe speech to text

Use AI to transcribe speech to text with an API

OCR to extract text from images

Use AI For Optical Character Recognition (OCR) to extract text from images via API

Remove backgrounds

Use AI to remove backgrounds from images and videos with an API

FLUX family of models

FLUX AI models by Black Forest Labs: image generation & editing via API

Restore images

Use AI to restore images via API

Enhance videos

Use AI to upscale, restore, extend, and enhance videos with an API

Detect NSFW content

Detect NSFW content in images and text

Classify text

Classify text by sentiment, topic, intent, or safety

Speaker diarization

Identify speakers from audio and video inputs

Create realistic face swaps

Replace faces across images with natural-looking results.

Turn sketches into images

Transform rough sketches into polished visuals

Generate emojis

Generate custom emojis from text or images

Generate anime-style images and videos

Create anime-style characters, scenes, and animations

Generate videos from images

Use AI to generate videos from images with an API

Vision models

Chat with images — visual Q&A, analysis, and reasoning via API

Caption Images

Use AI to generate captions and descriptions from images with an API

Edit your videos

Use AI to edit, restyle, extend, and remix videos with an API

WAN family of models

WAN family of models: open-source video, image, and audio generation

Create 3D content

Generate 3D objects, meshes, and textures from text or images with an API

Official models

Official models are always on, predictably priced, and have a stable API.

Large Language Models (LLMs)

Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API

Try AI models for free

Try AI Models for free: video generation, image generation, upscaling, and photo restoration

Lipsync videos

Use AI to generate lipsync videos with an API

Control image generation

Use AI to control image generation with an API

Embedding models

Embedding models for AI search and analysis

Object detection and segmentation

Use AI object detection and segmentation models to distinguish objects in images & videos

Flux fine-tunes

Flux fine-tunes: build and run custom AI image models via API

Kontext fine-tunes

Kontext fine-tunes: Build custom AI image models with an API

Create songs with voice cloning

Create songs with voice cloning models via API

Media utilities

AI media utilities: auto-caption, watermark, frame extraction & more via API

Qwen-Image fine-tunes

Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.

Latest models

luke100000 / mage-flow-edit

Mage-Flow is a compact 4B-scale generative stack for efficient text-to-image generation and instruction-based image editing

16 runs

luke100000 / mage-flow

Mage-Flow is a compact 4B-scale generative stack for efficient text-to-image generation and instruction-based image editing

40 runs

irulkenzei / stable-audio-open

Stable Audio Open 1.0 generates variable-length (up to 47s) stereo audio at 44.1kHz from text prompts. It comprises three components: an autoencoder that compresses waveforms into a manageable sequence length, a T5-based text embedding for text conditioni

119 runs

anthropic / claude-sonnet-5

Anthropic's most agentic Sonnet model, bringing frontier-level coding and tool use at Sonnet's speed and price

2.8K runs

Official

okgodoit-repos / mindprint-zimage-turbo

Mindprint seamless-tiling pattern generator: Z-Image-Turbo (6B) with toroidal-RoPE + circular VAE for pixel-wrap seamless repeating patterns (per-axis tile_x/tile_y).

62 runs

okgodoit-repos / mindprint-flux2-klein

Mindprint seamless-tiling pattern generator: FLUX.2-klein-4B with toroidal-RoPE + circular VAE for pixel-wrap seamless repeating patterns (per-axis tile_x/tile_y).

88 runs