Explore

Featured models

pixverse / pixverse-v4

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p

minimax / speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

7K runs

minimax / voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

649 runs

zsxkib / step1x-edit

✍️Step1X-Edit by stepfun-ai, Edit an image using text prompt📸

3.4K runs

ideogram-ai / ideogram-v3-balanced

Balance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles

6K runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

9.3K runs

ideogram-ai / ideogram-v3-quality

The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles

17.1K runs

kwaivgi / kling-v2.0

Generate 5s and 10s videos in 720p resolution

8.2K runs

nvidia / sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

110.5K runs

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

Get started Learn more

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Generate speech

Convert text to speech

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use a face to make images

Make realistic images of people instantly

Edit images

Tools for manipulating images.

Caption videos

Models that generate text from videos

Generate text

Models that can understand and generate text

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Remove backgrounds

Models that remove backgrounds from images and videos

Detect objects

Models that detect or segment objects in images and videos.

Generate music

Models to generate and modify music

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Get embeddings

Models that generate embeddings from inputs

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 1 month, 3 weeks ago 936.2M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 3 months, 1 week ago 19.6M runs

openai/whisper

Convert speech in audio to text

Updated 5 months, 2 weeks ago 85.4M runs

yorickvp/llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 9 months, 3 weeks ago 26.3M runs

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

Updated 1 month, 3 weeks ago 11.3M runs

andreasjansson/blip-2

Answers questions about images

Updated 1 year, 5 months ago 29.8M runs

lucataco/qwen-vl-chat

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

Updated 1 year, 6 months ago 813.6K runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 7 months ago 29.1M runs

Latest models

chenxwh/cosyvoice2-0.5b

Scalable Streaming Speech Synthesis with Large Language Models

Updated 4 months, 2 weeks ago 4.8K runs

fire/flux

Updated 4 months, 2 weeks ago 39 runs

foundationvision/infinity

Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Updated 4 months, 2 weeks ago 381 runs

meta/llama-guard-3-11b-vision

A Llama-3.2-11B pretrained model, fine-tuned for content safety classification

Updated 4 months, 2 weeks ago 663 runs

ardianfe/demucs-prod

sound separation with demucs

Updated 4 months, 2 weeks ago 44.1K runs

alexgenovese/flux-sd3-flow-edit

FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models

Updated 4 months, 2 weeks ago 398 runs

ahm3texe/test999

Updated 4 months, 2 weeks ago 5 runs

meta/llama-guard-3-8b

A Llama-3.1-8B pretrained model, fine-tuned for content safety classification

Updated 4 months, 2 weeks ago 33.1K runs

meta/llamaguard-7b

A 7B parameter Llama 2-based input-output safeguard model

Updated 4 months, 2 weeks ago 21 runs

daanelson/flux-fill-dev-big

Image inpainting with flux

Updated 4 months, 2 weeks ago 48 runs

lucataco/qwen2-vl-7b-instruct

Latest model in the Qwen family for chatting with video and image models

Updated 4 months, 2 weeks ago 41.1K runs

851-labs/background-remover

Remove backgrounds from images.

Updated 4 months, 3 weeks ago 2M runs

ibm-granite/granite-3.1-8b-instruct

Granite-3.1-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

Updated 4 months, 3 weeks ago 764.2K runs

ibm-granite/granite-3.1-2b-instruct

Granite-3.1-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

Updated 4 months, 3 weeks ago 9.1K runs

jzhang38/fast-hunyuan-video

Fast Hunyuan Video by Hao AI Lab

Updated 4 months, 3 weeks ago 433 runs

ryan5453/demucs

Demucs is an audio source separator created by Facebook Research.

Updated 4 months, 3 weeks ago 405.7K runs

ahm3texe/blur

Açıklama Testi

Updated 4 months, 3 weeks ago 8 runs

arthur630-tech/mob

Updated 4 months, 3 weeks ago 1.2K runs

jzhang38/fast-mochi

Fast Mochi by Hao AI Lab

Updated 4 months, 3 weeks ago 58 runs

minimax/music-01

Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track

Updated 4 months, 3 weeks ago 172K runs

toanbarcelona1998/dream_shaper8

Updated 4 months, 3 weeks ago 8 runs

lucataco/ollama-llama3.2-vision-90b

Ollama Llama 3.2 Vision 90B

Updated 4 months, 3 weeks ago 2.3K runs

vetkastar/comfy-flux

comfy with flux model,

Updated 4 months, 3 weeks ago 172.9K runs

lucataco/ollama-llama3.2-vision-11b

Ollama Llama 3.2 Vision 11B

Updated 4 months, 3 weeks ago 1.8K runs

lucataco/ollama-qwq

Ollama QwQ 32B

Updated 4 months, 3 weeks ago 55 runs

asyasyarif/ootd_masking

Clothing segmentation tool that generates masks from outfit images, separating them into top and bottom pieces with automatic background removal and edge refinement.

Updated 4 months, 3 weeks ago 53 runs

lucataco/ollama-llama3.3-70b

Ollama Llama 3.3 70B

Updated 4 months, 3 weeks ago 13.6K runs

jyoung105/hyper-sdxl

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

Updated 4 months, 3 weeks ago 122 runs

lucataco/apollo-7b

Apollo 7B - An Exploration of Video Understanding in Large Multimodal Models

Updated 4 months, 3 weeks ago 3.8K runs

minimax/video-01-live

An image-to-video (I2V) model specifically trained for Live2D and general animation use cases

Updated 4 months, 3 weeks ago 102.6K runs

lucataco/apollo-3b

Apollo 3B - An Exploration of Video Understanding in Large Multimodal Models

Updated 4 months, 3 weeks ago 112 runs

lucataco/rembg-video

Video Background Removal

Updated 4 months, 3 weeks ago 1.9K runs

turian/arxiv-llm-text

Prepare arXiv papers for processing by Large Language Models (LLMs) by converting them into a single, expanded LaTeX file.

Updated 4 months, 3 weeks ago 18 runs

zsyoaoa/invsr

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Updated 4 months, 3 weeks ago 2.9K runs

lucataco/bulk-video-caption

Video Preprocessing tool for captioning multiple videos using GPT, Claude or Gemini

Updated 4 months, 3 weeks ago 112 runs

lucataco/video-split

Simple tool to split apart a video into snippets

Updated 4 months, 3 weeks ago 122 runs

subhash25rawat/logo-in-context

Create ads for marketing, social media with your own company logo on any object you want.

Updated 4 months, 3 weeks ago 308 runs

luma/ray

Fast, high quality text-to-video and image-to-video (Also known as Dream Machine)

Updated 4 months, 4 weeks ago 30.7K runs

genmoai/mochi-1-lora-trainer

a-r-r-o-w/cogvideox-factory for Mochi-1 LoRA Training

Updated 4 months, 4 weeks ago 28 runs

zsxkib/instant-id

Make realistic images of real people instantly

Updated 4 months, 4 weeks ago 894.6K runs

zsxkib/hunyuan-video2video

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions

Updated 5 months ago 2.4K runs

zsxkib/memo

MEMO is a state-of-the-art open-weight model for audio-driven talking video generation.

Updated 5 months ago 729 runs

impetusdesign/rqi-txc-itp

Updated 5 months ago 359 runs

zurk/hunyuan-video-8bit

Hunyuan Video 8bit model API for video generation

Updated 5 months ago 238 runs

chenxwh/nitrofusion

High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

Updated 5 months ago 151 runs

nvidia/sana

A fast image model with wide artistic range and resolutions up to 4096x4096

Updated 5 months ago 142.9K runs

lucataco/moondream-0.5b

Moondream 0.5B, the world's smallest vision language model

Updated 5 months ago 49 runs

qubit999/qwen2.5-coder-32b-instruct

The Qwen2.5-Coder-32B-Instruct is a state-of-the-art, open-source large language model (LLM). It is specifically designed for coding tasks and is part of the Qwen2.5-Coder series, featuring 32 billion parameters.

Updated 5 months ago 49 runs

luma/photon-flash

Accelerated variant of Photon prioritizing speed while maintaining quality

Updated 5 months ago 65K runs

luma/photon

High-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs

Updated 5 months ago 373.5K runs

Featured models

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} pixverse / pixverse-v4

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} minimax / speech-02-hd

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} minimax / voice-cloning

zsxkib / step1x-edit

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} ideogram-ai / ideogram-v3-balanced

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} ideogram-ai / ideogram-v3-turbo

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} ideogram-ai / ideogram-v3-quality

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} kwaivgi / kling-v2.0