Explore

Featured models

pixverse / pixverse-v4

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p

minimax / speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

8.1K runs

minimax / voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

667 runs

zsxkib / step1x-edit

✍️Step1X-Edit by stepfun-ai, Edit an image using text prompt📸

3.5K runs

ideogram-ai / ideogram-v3-balanced

Balance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles

6.2K runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

9.6K runs

ideogram-ai / ideogram-v3-quality

The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles

17.6K runs

kwaivgi / kling-v2.0

Generate 5s and 10s videos in 720p resolution

8.4K runs

nvidia / sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

111K runs

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

Get started Learn more

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Generate speech

Convert text to speech

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use a face to make images

Make realistic images of people instantly

Edit images

Tools for manipulating images.

Caption videos

Models that generate text from videos

Generate text

Models that can understand and generate text

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Remove backgrounds

Models that remove backgrounds from images and videos

Detect objects

Models that detect or segment objects in images and videos.

Generate music

Models to generate and modify music

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Get embeddings

Models that generate embeddings from inputs

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 1 month, 3 weeks ago 936.8M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 3 months, 1 week ago 19.8M runs

openai/whisper

Convert speech in audio to text

Updated 5 months, 2 weeks ago 85.4M runs

yorickvp/llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 9 months, 3 weeks ago 26.3M runs

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

Updated 1 month, 3 weeks ago 11.3M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 7 months ago 29.1M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 11 months, 2 weeks ago 78.9M runs

andreasjansson/blip-2

Answers questions about images

Updated 1 year, 5 months ago 29.8M runs

Latest models

meta/meta-llama-3-70b-instruct

A 70 billion parameter language model from Meta, fine tuned for chat completions

Updated 1 year ago 151.9M runs

meta/meta-llama-3-8b-instruct

An 8 billion parameter language model from Meta, fine tuned for chat completions

Updated 1 year ago 358.5M runs

meta/meta-llama-3-8b

Base version of Llama 3, an 8 billion parameter language model from Meta.

Updated 1 year ago 50.9M runs

jordillull/acme-assistant

An example of a rudimentary Q&A assistant for ACME SL

Updated 1 year ago 10 runs

camenduru/zest

ZeST: Zero-Shot Material Transfer from a Single Image

Updated 1 year ago 1.5K runs

camenduru/instantmesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Updated 1 year ago 42K runs

cjwbw/parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

Updated 1 year ago 2.5K runs

camenduru/zephyr-orpo-141b-a35b-v0.1

Mixtral 8x22b v0.1 Zephyr Orpo 141b A35b v0.1

Updated 1 year ago 137 runs

cjwbw/pixart-sigma

Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Updated 1 year ago 6.6K runs

camenduru/magictime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Updated 1 year ago 368 runs

holywalley/stt_be_ctc

Updated 1 year ago 76 runs

nateraw/musicgen-songstarter-v0.2

A large, stereo MusicGen that acts as a useful tool for music producers

Updated 1 year ago 4K runs

camenduru/mixtral-8x22b-v0.1-4bit

Mixtral-8x22b-v0.1-4bit

Updated 1 year ago 362 runs

manu-sapiens/python-pptx

Use a subset of https://github.com/barun-saha/slide-deck-ai to create powerpoint slides from a json description - using python-pptx (https://github.com/scanny/python-pptx)

Updated 1 year, 1 month ago 287 runs

ai-forever/kandinsky-2.2

multilingual text2image latent diffusion model

Updated 1 year, 1 month ago 10M runs

camenduru/streaming-t2v

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Updated 1 year, 1 month ago 4K runs

fofr/face-to-many

Turn a face into 3D, emoji, pixel art, video game, claymation or toy

Updated 1 year, 1 month ago 13.3M runs

camenduru/emage

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling

Updated 1 year, 1 month ago 103 runs

jyoung105/instant-style-control

Free Lunch towards Style-Preserving in Text-to-Image Generation by InstantX team, with ControlNet

Updated 1 year, 1 month ago 551 runs

jyoung105/instant-style

Free Lunch towards Style-Preserving in Text-to-Image Generation by InstantX team

Updated 1 year, 1 month ago 1.9K runs

mrhan1993/fooocus-api

Updated 1 year, 1 month ago 995.2K runs

camenduru/minigpt4-video

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

Updated 1 year, 1 month ago 824 runs

yxzwayne/bge-reranker-v2-m3

Newest balance-striking reranker model from BAAI. Outputs rank scores for query-doc pairs. FP16 inference enabled.

Updated 1 year, 1 month ago 61.4K runs

camenduru/open-sora-plan-512x512

Open Sora Plan Text To Video

Updated 1 year, 1 month ago 1.7K runs

bytedance/res-adapter

Domain Consistent Resolution Adapter for Diffusion Models: generating consistent images with resolutions outside of their trained domain

Updated 1 year, 1 month ago 1.4K runs

jinchanz/sticker

sticker maker fork from fofr

Updated 1 year, 1 month ago 785 runs

adirik/interior-design

Realistic interior design with text and image inputs

Updated 1 year, 1 month ago 825K runs

leclem/seine-transition

Generate a video transitioning from one image to another using SEINE model

Updated 1 year, 1 month ago 2.3K runs

yxzwayne/bge-reranker-v2-gemma

Outputs a relevance/similarity score or a list of scores for a pair or pairs of string data. FP16 enabled.

Updated 1 year, 1 month ago 113 runs

myaiteam2/album-video-maker

Updated 1 year, 1 month ago 79 runs

vectradmin/sdxl-v-transparent

Updated 1 year, 1 month ago 150.6K runs

bryantanjw/entropy-lol

LoRA + Iterative 4x Upscale ComfyUI Workflow

Updated 1 year, 1 month ago 3.1K runs

re-mix-1/sdxl-lightning-8step

An implementation of ByteDance/SDXL-Lightning-8step

Updated 1 year, 1 month ago 555 runs

shreejalmaharjan-27/website-screenshot

Capture a website screenshot

Updated 1 year, 1 month ago 940.4K runs

xiankgx/sdxl-evolution-0.1

Genetic algorithm like mixing of SDXL models

Updated 1 year, 1 month ago 900 runs

cjwbw/aniportrait-audio2vid

Audio-Driven Synthesis of Photorealistic Portrait Animations

Updated 1 year, 1 month ago 14.3K runs

muqtadar08/image_to_text

Updated 1 year, 1 month ago 25 runs

camenduru/attribute-control

Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions

Updated 1 year, 1 month ago 106 runs

camenduru/arc2face

Arc2Face: A Foundation Model of Human Faces

Updated 1 year, 1 month ago 1.5K runs

camenduru/grm

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

Updated 1 year, 1 month ago 603 runs

astramlco/diffbir

DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Updated 1 year, 1 month ago 503 runs

zylim0702/remove_bg

Best Human detection and Object Detection Background removal.

Updated 1 year, 1 month ago 9.1K runs

tiger-ai-lab/anyv2v

Tuning-free framework to achieve high appearance and temporal consistency in video editing

Updated 1 year, 1 month ago 980 runs

spuuntries/erosumika-7b-v3-0.2-gguf

localfultonextractor's Erosumika 7B Mistral Merge, GGUF Q4_K_S-imat quantized by Lewdiculous.

Updated 1 year, 1 month ago 929 runs

meta/musicgen

Generate music from a prompt or melody

Updated 1 year, 1 month ago 2.4M runs

lucataco/sdxs-512-0.9

sdxs-512-0.9 can generate high-resolution images in real-time based on prompt texts, trained using score distillation and feature matching

Updated 1 year, 1 month ago 18.8K runs

camenduru/aniportrait-vid2vid

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Updated 1 year, 1 month ago 4.4K runs

hamelsmu/honeycomb-4-awq

Honeycomb NLQ Generator hosted with vLLM + AWQ Quantized

Updated 1 year, 1 month ago 102 runs

fermatresearch/sdxl-controlnet-lora-inpaint

Good old controlnet + inpaint + lora

Updated 1 year, 1 month ago 1.5K runs

smoretalk/rembg-enhance

A background removal model enhanced with better matting

Updated 1 year, 1 month ago 5M runs

Featured models

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} pixverse / pixverse-v4

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} minimax / speech-02-hd

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} minimax / voice-cloning

zsxkib / step1x-edit

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} ideogram-ai / ideogram-v3-balanced

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} ideogram-ai / ideogram-v3-turbo

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} ideogram-ai / ideogram-v3-quality

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-pylon-widget": false, "show-open-ai-api-instructions": false}} kwaivgi / kling-v2.0