Explore

Featured models

pixverse / pixverse-v4

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p

minimax / speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

8.1K runs

minimax / voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

667 runs

zsxkib / step1x-edit

✍️Step1X-Edit by stepfun-ai, Edit an image using text prompt📸

3.5K runs

ideogram-ai / ideogram-v3-balanced

Balance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles

6.2K runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

9.6K runs

ideogram-ai / ideogram-v3-quality

The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles

17.5K runs

kwaivgi / kling-v2.0

Generate 5s and 10s videos in 720p resolution

8.4K runs

nvidia / sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

110.9K runs

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

Get started Learn more

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Generate speech

Convert text to speech

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use a face to make images

Make realistic images of people instantly

Edit images

Tools for manipulating images.

Caption videos

Models that generate text from videos

Generate text

Models that can understand and generate text

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Remove backgrounds

Models that remove backgrounds from images and videos

Detect objects

Models that detect or segment objects in images and videos.

Generate music

Models to generate and modify music

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Get embeddings

Models that generate embeddings from inputs

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 1 month, 3 weeks ago 936.8M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 3 months, 1 week ago 19.8M runs

openai/whisper

Convert speech in audio to text

Updated 5 months, 2 weeks ago 85.4M runs

yorickvp/llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 9 months, 3 weeks ago 26.3M runs

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

Updated 1 month, 3 weeks ago 11.3M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 7 months ago 29.1M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 11 months, 2 weeks ago 78.9M runs

andreasjansson/blip-2

Answers questions about images

Updated 1 year, 5 months ago 29.8M runs

Latest models

baaivision/emu3-gen

Emu3-Gen for image generation

Updated 7 months, 1 week ago 42 runs

pikachupichu25/image-faceswap

Updated 7 months, 2 weeks ago 5.5K runs

zsxkib/molmo-7b

allenai/Molmo-7B-D-0924, Answers questions and caption about images

Updated 7 months, 2 weeks ago 99.6K runs

zsxkib/flux-music

🎼FluxMusic Text-to-Music Generation with Rectified Flow Transformer🎶

Updated 7 months, 2 weeks ago 1.4K runs

justmalhar/meta-llama-3.2-3b

Meta Llama 3.2 1B

Updated 7 months, 2 weeks ago 1.6K runs

justmalhar/meta-llama-3.2-1b

Meta Llama 3.2 1B

Updated 7 months, 2 weeks ago 182 runs

okaris/omni-zero-couples

Omni-Zero Couples: A diffusion pipeline for zero-shot stylized couples portrait creation.

Updated 7 months, 2 weeks ago 5.5K runs

aleksanderobuchowski/bielik-11b-v2.3-instruct

Bielik-11B-v2.3-Instruct is a generative text model made by SpeakLeash and Cyfronet featuring 11 billion parameters. It is a linear merge of the Bielik-11B-v2.0-Instruct, Bielik-11B-v2.1-Instruct, and Bielik-11B-v2.2-Instruct models.

Updated 7 months, 2 weeks ago 1.2K runs

juanfranem/ip-adapter-full-face

Implementation of tencent-ailab/IP-Adapter with ip-adapter-plus-face_sd15

Updated 7 months, 2 weeks ago 156 runs

chenxwh/cogvlm2-video

CogVLM2: Visual Language Models for Image and Video Understanding

Updated 7 months, 2 weeks ago 652.6K runs

chenxwh/cogvlm2

CogVLM2: Visual Language Models for Image and Video Understanding

Updated 7 months, 2 weeks ago 621 runs

fofr/expression-editor

Quickly edit the expression of a face

Updated 7 months, 2 weeks ago 70.5K runs

ictnlp/llama-omni

Seamless Speech Interaction with Large Language Models

Updated 7 months, 2 weeks ago 58.4K runs

lucataco/ollama-qwen2.5-72b

Ollama Qwen2.5 72b

Updated 7 months, 2 weeks ago 719 runs

thudm/cogvideox-i2v

Image-to-Video Diffusion Models with An Expert Transformer

Updated 7 months, 2 weeks ago 909 runs

thudm/cogvideox-t2v

Text-to-Video Diffusion Models with An Expert Transformer

Updated 7 months, 2 weeks ago 249 runs

fofr/flux-dev-layers

Explore how Flux Dev responds when you change the strengths of layers in the model. See readme for examples of how to select layers.

Updated 7 months, 2 weeks ago 8.5K runs

lucataco/joy-caption-pre-alpha

Image Caption model

Updated 7 months, 3 weeks ago 377 runs

zsxkib/flux-dev-inpainting-controlnet

FLUX.1-dev Inpainting ControlNet model

Updated 7 months, 3 weeks ago 7.7K runs

erayyavuz/interior-ai

Create lifelike interior designs with AI from text descriptions and image references.

Updated 7 months, 3 weeks ago 2.5K runs

fermatresearch/flux-controlnet-inpaint

Run inpainting with Flux, compatible with Canny ControlNet, LoRAs and HyperFlux_8step

Updated 7 months, 3 weeks ago 17.7K runs

as-himself/artyficial-bongo-0.1

An experimental flux based model for creative research

Updated 7 months, 3 weeks ago 139 runs

pnyompen/sd-controlnet-lora

SD1.5 Canny controlnet with LoRA support.

Updated 7 months, 3 weeks ago 548.6K runs

bytedance/flux-pulid

⚡️FLUX PuLID: FLUX-dev based Pure and Lightning ID Customization via Contrastive Alignment🎭

Updated 7 months, 3 weeks ago 1.8M runs

neurowelt/keros-diffusion

Controlling SD XL diffusion inference

Updated 7 months, 3 weeks ago 10 runs

jschoormans/comfyui-interior-remodel

Interior remodelling, keeps windows, ceilings, and doors. Uses a depth controlnet weighted to ignore existing furniture.

Updated 7 months, 4 weeks ago 21.8K runs

pikachupichu25/live-portrait-image

Match facial expression using a driving image using LivePortrait as a base

Updated 7 months, 4 weeks ago 75.8K runs

aodianyun/minicpm-v-26

Updated 7 months, 4 weeks ago 12 runs

aodianyun/minicpm-v-26-int4

Updated 7 months, 4 weeks ago 10 runs

remodela-ai/style-transfer-ii

Updated 7 months, 4 weeks ago 251 runs

hexiaochun/video_merge

视频合并

Updated 8 months ago 1.5K runs

hexiaochun/img2video

输入图片和音频合并关键帧视频

Updated 8 months ago 5.4K runs

hexiaochun/video_uitls

视频转换工具包

Updated 8 months ago 7 runs

lucataco/ollama-reflection-70b

Ollama Reflection 70b

Updated 8 months ago 1.6K runs

hexiaochun/minicpm_v26

minicpm 视频理解

Updated 8 months ago 507 runs

0xroyce/plutus

Fine-tuned version of the LLaMA-3.1-8B model, specifically optimized for tasks in finance, economics, trading, psychology, and social engineering.

Updated 8 months ago 50 runs

nicknaskida/incredibly-fast-whisper

whisper-large-v3, incredibly fast, with speaker diarization, powered by Hugging Face Transformers! 🤗

Updated 8 months ago 198 runs

aodianyun/qwen2-vl-7b

Updated 8 months ago 1.6K runs

aodianyun/qwen2-vl-2b

Updated 8 months ago 126 runs

lucataco/flux-schnell-lora

FLUX.1-Schnell LoRA Explorer

Updated 8 months ago 1.1M runs

xavriley/beat_this

Detect beats in music

Updated 8 months ago 23 runs

fofr/nsfw-model-comparison

Compare nsfw models against inputs

Updated 8 months, 1 week ago 139 runs

pipi32167/minicpm-v-26

Chat with image or video.

Updated 8 months, 1 week ago 529 runs

argildotai/sam2removevideobackground

This project uses the Segment Anything 2 (SAM2) model to remove backgrounds from videos.

Updated 8 months, 1 week ago 1.1K runs

usamaehsan/flux-multicontrolnet

multi controlnet union pro <-

Updated 8 months, 1 week ago 78 runs

helios-infotech/sketch_to_image

AI that transforms sketches into realistic images. Upload your drawing and describe it in the prompt. You can also adjust the ControlNet parameters and scale the image to a higher resolution for better results

Updated 8 months, 1 week ago 2.1K runs

asiryan/kolors

Kolors Model (Text2Img and Img2Img)

Updated 8 months, 1 week ago 16.6K runs

cuuupid/qwen2-vl-2b

SOTA open-source model for chatting with videos and the newest model in the Qwen family

Updated 8 months, 1 week ago 518 runs

victor-upmeet/whisperx-a40-large

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3 for large audio files

Updated 8 months, 1 week ago 163.4K runs

victor-upmeet/whisperx

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3

Updated 8 months, 1 week ago 2.2M runs

Featured models

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} pixverse / pixverse-v4

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} minimax / speech-02-hd

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "show-pylon-widget": false}} minimax / voice-cloning