Run a model from Node.js

Run a model from Google Colab

Run a model from Python

Fine-tune an image model

How does Replicate work?

Client libraries

Documentation

Replicate lets you run AI models with a cloud API, without having to understand machine learning or manage your own infrastructure.

You can run open-source models that other people have published, bring your own training data to create fine-tuned models , or build and publish custom models from scratch.

Get started

Try out Replicate quickly in your environment of choice

Run a model from Node.js

Run a model from Node.js

Run a model from Node.js

Run a model from Google Colab

Run a model from Google Colab

Run a model from Google Colab

Run a model from Python

Run a model from Python

Run a model from Python

Fine-tune an image model

Fine-tune an image model

Fine-tune an image model

Topics

Learn about the building blocks that make Replicate work

Models

Predictions

Organizations

Billing

Site policy

Guides

Ready to build something? Check out these guides to level up your AI skills

Make art with Stable Diffusion

Make art with Stable Diffusion

Make art with Stable Diffusion

Build a website with Next.js

Build a website with Next.js

Build a website with Next.js

Build a Discord bot with Python

Build a Discord bot with Python

Build a Discord bot with Python

Build an app with SwiftUI

Build an app with SwiftUI

Build an app with SwiftUI

Push your own model

Push your own model

Push your own model

Push a Diffusers model

Push a Diffusers model

Push a Diffusers model

Get a GPU on Brev

Get a GPU on Brev

Get a GPU on Brev

Working with LoRAs

Working with LoRAs

Working with LoRAs

ComfyUI

ComfyUI

Videos

Prefer to learn by watching videos? Check out some recent demos from our YouTube channel

Thumbnail for Run Replicate models using Cloudflare Workers

Run Replicate models using Cloudflare Workers

Create and deploy a web app with a serverless backend and a React frontend in under 60 seconds.

7 minutes

Thumbnail for Create stylized videos using pre-trained HuggingFace LoRAs

Create stylized videos using pre-trained HuggingFace LoRAs

Make video content using the Hunyuan model with pre-trained styles from HuggingFace, or using your own images as training data.

3 minutes

Thumbnail for FLUX.1 Schnell vs FLUX.1 Dev

FLUX.1 Schnell vs FLUX.1 Dev

Explore the differences between Flux Schnell and Flux Dev image generation models and learn how to enhance image quality effectively.

6 minutes

Thumbnail for David Attenborough is now narrating my life

David Attenborough is now narrating my life

Here's a GPT-4-vision + ElevenLabs python script so you can star in your own Planet Earth.

2 minutes

Thumbnail for Write your shell commands in English

Write your shell commands in English

Use language models like GPT-4o and Llama to write one-liner shell commands, then execute them.

4 minutes

Thumbnail for Introducing create-replicate-app

Introducing create-replicate-app

A quick and easy way to run Replicate models with Node.js

3 minutes

Thumbnail for Using webhooks with Replicate's API

Using webhooks with Replicate's API

Learn how to receive webhooks from Replicate's API when running predictions and trainings.

14 minutes

View more videos on YouTube