Explore

Featured models

pixverse / pixverse-v4

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p

minimax / speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

6.9K runs

minimax / voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

630 runs

zsxkib / step1x-edit

✍️Step1X-Edit by stepfun-ai, Edit an image using text prompt📸

3.3K runs

ideogram-ai / ideogram-v3-balanced

Balance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles

5.9K runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

9.2K runs

ideogram-ai / ideogram-v3-quality

The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles

16.9K runs

kwaivgi / kling-v2.0

Generate 5s and 10s videos in 720p resolution

8.2K runs

nvidia / sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

109.8K runs

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

Get started Learn more

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Generate speech

Convert text to speech

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use a face to make images

Make realistic images of people instantly

Edit images

Tools for manipulating images.

Caption videos

Models that generate text from videos

Generate text

Models that can understand and generate text

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Remove backgrounds

Models that remove backgrounds from images and videos

Detect objects

Models that detect or segment objects in images and videos.

Generate music

Models to generate and modify music

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Get embeddings

Models that generate embeddings from inputs

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 1 month, 3 weeks ago 935.9M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 3 months, 1 week ago 19.5M runs

openai/whisper

Convert speech in audio to text

Updated 5 months, 2 weeks ago 85.3M runs

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

Updated 1 month, 3 weeks ago 11.3M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 7 months ago 29.1M runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 10 months, 2 weeks ago 13.5M runs

tencentarc/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 1 year, 1 month ago 89.3M runs

krthr/clip-embeddings

Generate CLIP (clip-vit-large-patch14) text & image embeddings

Updated 1 year, 8 months ago 27.9M runs

Latest models

jhorovitz/omini-schnell

Place items in a scene without needing to train on them

Updated 2 months, 3 weeks ago 2.5K runs

jhorovitz/omini-dev

Cogified implementation of OminiControl

Updated 2 months, 3 weeks ago 74 runs

moonpig/dis-background-removal

Updated 2 months, 3 weeks ago 81 runs

mtg/music-arousal-valence

Regression of musical arousal and valence values

Updated 2 months, 3 weeks ago 8.8K runs

lucataco/step-audio-tts-3b

Step-Audio-TTS-3B represents the industry's first Text-to-Speech (TTS) model trained on a large-scale synthetic dataset utilizing the LLM-Chat paradigm

Updated 2 months, 3 weeks ago 1.1K runs

ocg2347/plksr-tiled-lowvram

Tiled inference implementation of PLKSR

Updated 2 months, 3 weeks ago 69 runs

ttsds/speecht5

Updated 2 months, 3 weeks ago 167 runs

lucataco/videollama3-7b

VideoLLaMA 3: Frontier Multimodal Foundation Models for Video Understanding

Updated 2 months, 3 weeks ago 1.9K runs

ostris/flex.1-alpha

Flex.1 alpha is a pre-trained base 8 billion parameter rectified flow transformer capable of generating images from text descriptions

Updated 2 months, 3 weeks ago 300 runs

tmappdev/change_video_bg

Change or Replace Video Background with any Image

Updated 2 months, 3 weeks ago 252 runs

jaaari/zonos

Zonos-v0.1 by Zyphra, voice cloning, 5 languages and emotion control

Updated 2 months, 3 weeks ago 1.2K runs

deepseek-ai/janus-pro-1b

Janus-Pro is a novel autoregressive framework for multimodal understanding

Updated 2 months, 4 weeks ago 6.7K runs

ttsds/pheme

Updated 2 months, 4 weeks ago 651 runs

anthropic/claude-3.5-haiku

Anthropic's fastest, most cost-effective model, with a 200K token context window (claude-3-5-haiku-20241022)

Updated 2 months, 4 weeks ago 906.9K runs

anthropic/claude-3.5-sonnet

Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)

Updated 2 months, 4 weeks ago 475.6K runs

mmezhov/catvton-flux

Updated 2 months, 4 weeks ago 303 runs

subhash25rawat/morphix3d

Transform Images & Text into 3D Models with AI

Updated 2 months, 4 weeks ago 44 runs

deepseek-ai/deepseek-vl2

DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL

Updated 2 months, 4 weeks ago 52.4K runs

deepseek-ai/deepseek-vl2-small

DeepSeek-VL2-small, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL

Updated 2 months, 4 weeks ago 681 runs

cuuupid/zonos

Zonos-v0.1 beta, a SOTA text-to-speech Transformer model with extraordinary expressive range, built by Zyphra.

Updated 3 months ago 254 runs

lucataco/dotted-video

Converts a video into a black and white dotted video effect

Updated 3 months ago 749 runs

zsxkib/hibiki

Hibiki: High-Fidelity Simultaneous Speech-To-Speech Translation

Updated 3 months ago 12 runs

tencent/hunyuan3d-2

Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Updated 3 months ago 1.2K runs

ttsds/parlertts_indic

Updated 3 months ago 208 runs

ttsds/parlertts_tiny_1_0

Updated 3 months ago 199 runs

alphanumericuser/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 3 months ago 2.1K runs

ttsds/parlertts_mini_multilingual

Updated 3 months ago 197 runs

ttsds/parlertts_mini_expresso

Updated 3 months ago 195 runs

zetyquickly-org/faceswap-a-gif

Make Fun by Changing Face on a GIF!

Updated 3 months ago 42.5K runs

google/imagen-3-fast

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

Updated 3 months ago 108.6K runs

google/imagen-3

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

Updated 3 months ago 803.4K runs

ttsds/parlertts_mini_1_1_fixed

Updated 3 months ago 191 runs

ttsds/parlertts_large_1_0

Updated 3 months ago 214 runs

ttsds/parlertts_mini_1_0

Updated 3 months ago 192 runs

delta-lock/noobai-xl

https://civitai.com/models/833294

Updated 3 months ago 28.4K runs

alexfernandez803/rembgplus

Rembg implementation with mask output

Updated 3 months ago 45 runs

ttsds/parlertts_mini_0_1

Updated 3 months ago 196 runs

deepseek-ai/janus-pro-7b

Janus-Pro is a novel autoregressive framework for multimodal understanding

Updated 3 months, 1 week ago 10.7K runs

fofr/yue

Generate music with YuE-s1-7B (English, chain of thought model)

Updated 3 months, 1 week ago 984 runs

jbilcke/oute-tts

Test deployment of OuteTTS 500M

Updated 3 months, 1 week ago 503 runs

rocketdigitalai/interior-design-sdxl-lightning

Interior Design with RealVisXL V5.0-Lightning and ControlNet to generate photorealistic, high-resolution interior designs.

Updated 3 months, 1 week ago 693 runs

rocketdigitalai/animagine-xl-4.0

Ultimate anime-themed finetuned SDXL model and the latest installment of the Animagine XL series

Updated 3 months, 1 week ago 668 runs

rocketdigitalai/interior-design-sdxl

Interior Design with RealVisXL V5.0 and ControlNet (Depth & Union SDXL ProMax) to generate photorealistic, high-resolution interior designs with enhanced depth and structure.

Updated 3 months, 1 week ago 911 runs

zsxkib/star

STAR Video Upscaler: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Updated 3 months, 1 week ago 437 runs

ttsds/openvoice_2

Updated 3 months, 1 week ago 529 runs

cureau/force-align-wordstamps

Takes audio (mp3) and a "source-of-truth" audio transcript (string) as input and returns precise timestamps.

Updated 3 months, 1 week ago 822 runs

ttsds/metavoice

Updated 3 months, 1 week ago 604 runs

zeke/ai-ci-cd-example

A demo model for a guide I'm working on...

Updated 3 months, 1 week ago 8 runs

edoproch/deepseekr1-distilled-llama-70b-ollama

DeepSeek-R1 distilled on LLaMA3.3 70B and quantized by ollama

Updated 3 months, 1 week ago 23 runs

ttsds/f5

Updated 3 months, 1 week ago 1.3K runs

Featured models

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} pixverse / pixverse-v4

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} minimax / speech-02-hd

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} minimax / voice-cloning

zsxkib / step1x-edit

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} ideogram-ai / ideogram-v3-balanced

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} ideogram-ai / ideogram-v3-turbo

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} ideogram-ai / ideogram-v3-quality

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} kwaivgi / kling-v2.0