Explore

Featured models

pixverse / pixverse-v4

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p

minimax / speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

8.3K runs

minimax / voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

680 runs

zsxkib / step1x-edit

✍️Step1X-Edit by stepfun-ai, Edit an image using text prompt📸

3.6K runs

ideogram-ai / ideogram-v3-balanced

Balance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles

6.6K runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

9.8K runs

ideogram-ai / ideogram-v3-quality

The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles

18.1K runs

kwaivgi / kling-v2.0

Generate 5s and 10s videos in 720p resolution

8.5K runs

nvidia / sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

112.1K runs

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

Get started Learn more

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Generate speech

Convert text to speech

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use a face to make images

Make realistic images of people instantly

Edit images

Tools for manipulating images.

Caption videos

Models that generate text from videos

Generate text

Models that can understand and generate text

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Remove backgrounds

Models that remove backgrounds from images and videos

Detect objects

Models that detect or segment objects in images and videos.

Generate music

Models to generate and modify music

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Get embeddings

Models that generate embeddings from inputs

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 1 month, 3 weeks ago 937.5M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 3 months, 1 week ago 19.9M runs

openai/whisper

Convert speech in audio to text

Updated 5 months, 2 weeks ago 85.5M runs

tencentarc/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 1 year, 1 month ago 89.3M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 7 months ago 29.2M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Updated 1 year, 2 months ago 5.9M runs

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

Updated 1 month, 3 weeks ago 11.3M runs

krthr/clip-embeddings

Generate CLIP (clip-vit-large-patch14) text & image embeddings

Updated 1 year, 8 months ago 27.9M runs

Latest models

lucataco/florence-2-base

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Updated 10 months, 2 weeks ago 61.3K runs

prakharsaxena24/2d-to-real-style

Updated 10 months, 2 weeks ago 493 runs

zsxkib/qwen2-7b-instruct

Qwen 2: A 7 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 10 months, 2 weeks ago 1.7K runs

zsxkib/qwen2-1.5b-instruct

Qwen 2: A 1.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 10 months, 2 weeks ago 217 runs

platform-kit/mars5-tts

A novel speech model for insane prosody.

Updated 10 months, 2 weeks ago 470 runs

zsxkib/qwen2-0.5b-instruct

Qwen 2: A 0.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 10 months, 2 weeks ago 199 runs

ardianfe/musicgen-ft

good for video teaser backsound

Updated 10 months, 2 weeks ago 58 runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 10 months, 2 weeks ago 13.5M runs

buddhiraz/photomaker_ape_stylized

Updated 10 months, 2 weeks ago 512 runs

zsxkib/sd3-controlnet

✨Stable Diffusion 3 w/ ⚡InstantX's Canny, Pose, and Tile ControlNets🖼️

Updated 10 months, 2 weeks ago 1.2K runs

fofr/sd3-explorer

A model for experimenting with all the SD3 settings. Non-commercial use only, unless you have a Stability AI Self Hosted License.

Updated 10 months, 2 weeks ago 32.2K runs

douwantech/gpt-sovits-train

Updated 10 months, 3 weeks ago 181 runs

stackadoc/stable-audio-open-1.0

Stable Audio Open is an open-source model optimized for generating short audio samples, sound effects, and production elements using text prompts.

Updated 10 months, 3 weeks ago 10.8K runs

fofr/sd3-with-chaos

Stable Diffusion 3 medium with added variability in outputs. Non-commercial use only, unless you have a Stability AI Self Hosted License.

Updated 10 months, 3 weeks ago 20.2K runs

xavriley/sax_transcription

Transcribe saxophone solos directly from audio

Updated 10 months, 3 weeks ago 185 runs

douwantech/musev

Updated 10 months, 3 weeks ago 347 runs

franz-biz/yolo-world-xl

Real-Time Open-Vocabulary Object Detection using the xl weights

Updated 10 months, 3 weeks ago 212.7K runs

charlesmccarthy/musicgen

MusicGen running on an a40 with 60 seconds max duration

Updated 10 months, 3 weeks ago 834 runs

magpai-app/cog-puppeteer

Updated 10 months, 3 weeks ago 171 runs

lucataco/mobius

Mobius, a diffusion model that pushes the boundaries of domain-agnostic debiasing and representation realignment

Updated 10 months, 3 weeks ago 621 runs

turian/dover-video-quality-assessment

DOVER video quality assessment tool, assigning videos both aesthetic and technical quality scores

Updated 10 months, 3 weeks ago 27 runs

dhanushreddy291/photo-background-generation

Generate Product photography backgrounds using Stable Diffusion

Updated 10 months, 3 weeks ago 520 runs

mareksagan/dreamgaussian

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation. Hologram optimized

Updated 10 months, 4 weeks ago 343 runs

mtg/music-classifiers

Transfer learning models for music classification by genres, moods, and instrumentation

Updated 10 months, 4 weeks ago 10.2K runs

zsxkib/v-express

🫦 Realistic facial expression manipulation (lip-syncing) using audio or video

Updated 10 months, 4 weeks ago 1K runs

douwantech/musepose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation.

Updated 11 months ago 950 runs

ahmdyassr/mask-clothing

Super fast clothing (and face) segmentation and masking with erosion and dilation capability, made for https://outfit.fm

Updated 11 months ago 16.1K runs

charlesmccarthy/pony-sdxl

The best Pony-SDXL models! Current one is based on Pony Realism.

Updated 11 months ago 87.5K runs

buddhiraz/chilloutmix-ni-pruned-fp32-fixx

Updated 11 months ago 181 runs

remodela-ai/scaling-model-v1

# Interior Decoration Space Scaling - First Use Case

Updated 11 months ago 66 runs

zeke/hello-world

A tiny model for testing out Cog

Updated 11 months ago 1.1K runs

skytells-research/photomaker

Updated 11 months ago 8.2K runs

skytells-research/photo-stylizer

Updated 11 months ago 1.6K runs

fofr/consistent-character

Create images of a given character in different poses

Updated 11 months ago 974.7K runs

jinchanz/comfyui

Updated 11 months ago 172 runs

douwantech/musetalk

Real-Time High Quality Lip Synchronization with Latent Space Inpainting

Updated 11 months ago 2.5K runs

myaiteam2/zip2mp4

Turns 10 mp4 into 1

Updated 11 months, 1 week ago 72 runs

fermatresearch/sdxl-outpainting-lora

An improved outpainting model that supports LoRA urls. This model uses PatchMatch to improve the mask quality.

Updated 11 months, 1 week ago 80.6K runs

cuuupid/garden-state-llama

Llama-3-8B finetuned with ReFT to hyperfocus on New Jersey, the Garden State, the best state, the only state!

Updated 11 months, 1 week ago 106 runs

zsxkib/hololive-style-bert-vits2

🎙️Hololive text-to-speech and voice-to-voice (Japanese🇯🇵 + English🇬🇧)

Updated 11 months, 1 week ago 854 runs

prakharsaxena24/masked-upscaler

Upscaler and detailer for a selected area

Updated 11 months, 1 week ago 4.8K runs

chenxwh/omost

Convert LLM's coding to image generation

Updated 11 months, 1 week ago 1.9K runs

charlesmccarthy/epicrealism-v7

epiCRealism v7-Final Destination. Top Realism Model on Civitai

Updated 11 months, 1 week ago 1.7K runs

charlesmccarthy/anima_pencil-xl

blue_pencil-XL meets ANIMAGINE XL 3.0 / ANIMAGINE XL 3.1, The top ranked model on Civitai

Updated 11 months, 1 week ago 3.8K runs

charlesmccarthy/cog-iniverse

Updated 11 months, 1 week ago 118 runs

thlz998/chat-tts

This is an implementation of the ChatTTS as a Cog model.

Updated 11 months, 1 week ago 3.1K runs

cjwbw/sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

Updated 11 months, 1 week ago 130.4K runs

johnsutor/emoji-painter

Recreate images with Emojis

Updated 11 months, 1 week ago 203 runs

ji4chenli/t2v-turbo

Fast and High-Quality Text-to-video Generation

Updated 11 months, 1 week ago 4.6K runs

lectralab/fashion-style-tranfer

A PhotoBooth style transfer workflow that utilizes IPadapter Style, Canny, OpenPose, RemoveBackground, HumanSegmentation, Cloth Segmentation for initial input, and concludes with the application of DeepFake techniques.

Updated 11 months, 1 week ago 181 runs

Featured models

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} pixverse / pixverse-v4

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} minimax / speech-02-hd

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} minimax / voice-cloning