Explore

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models
recraft-v4.1-utility-pro

recraft-ai / recraft-v4.1-utility-pro

A faster, lighter Recraft image generation model at ~2048px resolution, optimized for high-volume production. Design taste and prompt accuracy at high resolution with better throughput.

6 runs
Official
recraft-v4.1-utility

recraft-ai / recraft-v4.1-utility

A faster, lighter Recraft image generation model optimized for high-volume and production pipelines. Same design taste as V4.1, built for speed and throughput.

38 runs
Official
recraft-v4.1-pro-svg

recraft-ai / recraft-v4.1-pro-svg

Generate detailed SVG vector graphics from text prompts. Recraft V4.1 Pro's design taste with more geometric detail and finer paths — clean layers, editable output, and scalable to any size.

10 runs
Official
recraft-v4.1-svg

recraft-ai / recraft-v4.1-svg

Generate production-ready SVG vector images from text prompts. Recraft V4.1's design taste applied to vector output — clean geometry, structured layers, and editable paths.

21 runs
Official
recraft-v4.1-pro

recraft-ai / recraft-v4.1-pro

Recraft's latest image generation model at ~2048px resolution. Same design taste and prompt accuracy as V4.1, with higher resolution for print-ready and large-scale work.

95 runs
Official
recraft-v4.1

recraft-ai / recraft-v4.1

Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, and integrated text rendering. Fast and cost-efficient at standard resolution.

545 runs
Official
grok-imagine-image-quality

xai / grok-imagine-image-quality

xAI's higher-quality image model with sharper details, better text rendering, and 2k output

7.6K runs
Official
scribe-v2

elevenlabs / scribe-v2

Transcribe speech with ElevenLabs Scribe v2. 90+ languages, word-level timestamps, speaker diarization for up to 32 speakers, audio event tagging, and keyterm biasing. Files up to 3 GB and 10 hours.

464 runs
Official
realtime-tts-2

inworld / realtime-tts-2

Most expressive text-to-speech model from Inworld, with natural-language steering, real-time latency, and multilingual support across 100+ languages.

1.4K runs
Official
clarity-pro-upscaler

philz1337x / clarity-pro-upscaler

The first creative upscaler which keeps identity. Stunning photorealistic results, realistic skin, and full creative control.

4.9K runs
Official
grok-text-to-speech

xai / grok-text-to-speech

Convert text to natural-sounding speech with xAI's Grok TTS. 5 voices, 20 languages, expressive speech tags, and high-fidelity MP3 / WAV / telephony audio output.

5.9K runs
Official
grok-speech-to-text

xai / grok-speech-to-text

Transcribe audio to text with xAI's Grok. Handles 25 languages, word-level timestamps, speaker diarization, multichannel audio, and files up to 500 MB.

548 runs
Official
granite-speech-4.1-2b

ibm-granite / granite-speech-4.1-2b

Granite Speech 4.1 2B is a compact and efficient speech-language model, specifically designed for multilingual automatic speech recognition (ASR) and bidirectional automatic speech translation (AST) for English, French, German, Spanish, Portuguese and Jap

29 runs
Official

alibaba / happyhorse-1.0

Alibaba's Happy Horse 1.0 generates videos from text prompts or animates a single image into video. Supports 720p and 1080p, 3-15 second durations, and five aspect ratios.

6.9K runs
Official
granite-embedding-small-english-r2

ibm-granite / granite-embedding-small-english-r2

Granite-embedding-small-english-r2 is a 47M parameter dense biencoder embedding model from the Granite Embeddings collection that can be used to generate high quality text embeddings.

19 runs
Official
granite-4.1-8b

ibm-granite / granite-4.1-8b

Granite-4.1-8B is a 8B parameter long-context instruct model finetuned from Granite-4.1-8B-Base using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets.

2.9K runs
Official

pixverse / pixverse-v6

PixVerse's flagship video generation model. Generate cinematic videos with synchronized audio, multi-shot sequences, and precise camera control.

17K runs
Official
kimi-k2.6

moonshotai / kimi-k2.6

Moonshot AI's frontier open model, built for long-horizon coding, agent swarms, and autonomous software engineering. 1 trillion parameters, 262k context window, vision and tool use.

2.1K runs
Official
gpt-image-2

openai / gpt-image-2

OpenAI's state-of-the-art image generation model. Create and edit images from text with strong instruction following, sharp text rendering, and detailed editing.

2M runs
Official
create-character-v1

uthana / create-character-v1

Rig any 3D bipedal character mesh

59 runs
Official

OCR to extract text from images

Use AI For Optical Character Recognition (OCR) to extract text from images via API

Classify text

Classify text by sentiment, topic, intent, or safety

Create realistic face swaps

Replace faces across images with natural-looking results.

Vision models

Chat with images — visual Q&A, analysis, and reasoning via API

Caption Images

Use AI to generate captions and descriptions from images with an API

Edit your videos

Use AI to edit, restyle, extend, and remix videos with an API

WAN family of models

WAN family of models: open-source video, image, and audio generation

Create 3D content

Generate 3D objects, meshes, and textures from text or images with an API

Large Language Models (LLMs)

Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API

Try AI models for free

Try AI Models for free: video generation, image generation, upscaling, and photo restoration

Object detection and segmentation

Use AI object detection and segmentation models to distinguish objects in images & videos

Flux fine-tunes

Flux fine-tunes: build and run custom AI image models via API

Media utilities

AI media utilities: auto-caption, watermark, frame extraction & more via API

Qwen-Image fine-tunes

Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.

Latest models