Run Code Llama 70B with an API
Code Llama 70B is one of the powerful open-source code generation models. Learn how to run it in the cloud with one line of code.
January 30, 2024
Fine-tune Llama 2 for English to Hinglish translation with axolotl
Learn how to put together a custom trainer with axolotl to fine-tune a language model for English to Hinglish translation.
January 23, 2024
Clone your voice using open-source models
We’ve added fine-tuning for realistic voice cloning (RVC). You can train RVC on your own dataset from a YouTube video with a few lines of code using Replicate's API.
December 6, 2023
How to create an AI narrator for your life
Or, how I met a virtual David Attenborough.
December 6, 2023
Businesses are building on open-source AI
We've raised a $40 million Series B led by a16z.
December 5, 2023
How to run Yi chat models with an API
The Yi series models are large language models trained from scratch by developers at 01.AI. Learn how to run them in the cloud with one line of code.
November 23, 2023
Scaffold Replicate apps with one command
We've added a CLI command that makes it easy to get started with Replicate.
November 22, 2023
Using open-source models for faster and cheaper text embeddings
An interactive example showing how to embed text using a state-of-the-art embedding model that beats OpenAI's embeddings API on price and performance.
November 10, 2023
Generate music from chord progressions and text prompts with MusicGen-Chord
We’ve added chord conditioning to Meta’s MusicGen model, so you can create automatic backing tracks in any style using text prompts and chord progressions.
November 8, 2023
Generate images in one second on your Mac using a latent consistency model
How to run a latent consistency model on your M1 or M2 Mac
October 25, 2023
How to use retrieval augmented generation with ChromaDB and Mistral
In this post we'll explore the basics of retrieval augmented generation by creating an example app that uses bge-large-en for embeddings, ChromaDB for vector store, and mistral-7b-instruct for language model generation.
October 17, 2023
Fine-tune MusicGen to generate music in any style
We’ve added fine-tuning support to MusicGen. You can train the small, medium and melody models on your own audio files using Replicate.
October 13, 2023
Jet-setting with Llama 2 + Grammars
How to use Llama 2 models with grammars for information extraction tasks.
October 9, 2023
How to run Mistral 7B with an API
Mistral 7B is an open-source large language model. Learn what it's good at and how to run it in the cloud with one line of code.
October 6, 2023
Make smooth AI generated videos with AnimateDiff and an interpolator
Combine AnimateDiff and the ST-MFNet frame interpolator to create smooth and realistic videos from a text prompt
October 4, 2023
Fine-tuned models now boot in less than one second
We've made some dramatic improvements to cold boots for fine-tuned models.
September 6, 2023
Painting with words: a history of text-to-image AI
With the recent release of Stable Diffusion XL fine-tuning on Replicate, and today being the 1-year anniversary of Stable Diffusion, now feels like the perfect opportunity to take a step back and reflect on how text-to-image AI has improved over the last few years.
August 22, 2023
We're cutting our prices in half
The price of public models is being cut in half, and soon we'll start charging new users for setup and idle time on private models.
August 16, 2023
A guide to prompting Llama 2
Learn the art of the Llama prompt.
August 14, 2023
Streaming output for language models
Our API now supports server-sent event streams for language models. Learn how to use them to make your apps more responsive.
August 14, 2023
Fine-tune SDXL with your own images
We’ve added fine-tuning (Dreambooth, Textual Inversion and LoRA) support to SDXL 1.0. You can train SDXL on your own images with one line of code using the Replicate API.
August 8, 2023
What’s the difference between Llama 2 7b, 13b, and 70b?
Let's break down the differences between the Llama 2 models and help you choose the right one for your use case.
August 4, 2023
Run Llama 2 with an API
Llama 2 is the first open source language model of the same caliber as OpenAI’s models. Learn how to run it in the cloud with one line of code.
July 27, 2023
Run SDXL with an API
How to run Stable Diffusion XL 1.0 using the Replicate API
July 26, 2023
A comprehensive guide to running Llama 2 locally
How to run Llama 2 on Mac, Linux, Windows, and your phone.
July 22, 2023
Fine-tune Llama 2 on Replicate
So you want to train a llama...
July 20, 2023
What happened with Llama 2 in the last 24 hours? 🦙
A roundup of recent developments from the llamaverse following the second major release of Meta's open-source large language model.
July 19, 2023
Make any large language model a better poet
Prompt engineering and training are often the first solutions we reach for to improve language model behavior, but they're not the only way.
May 26, 2023
We've added a status page to provide real-time updates on the health of Replicate.
May 18, 2023
Language model roundup, April 2023
A roundup of recent developments from the world of open-source language models.
April 21, 2023
AutoCog — Generate Cog configuration with GPT-4
Give it a machine learning directory and AutoCog will create predict.py and cog.yaml until it successfully runs a prediction
April 19, 2023
Language models are on Replicate
You can now deploy, run, and fine-tune large language models on Replicate.
April 5, 2023
How to use Alpaca-LoRA to fine-tune a model like ChatGPT
Low-rank adaptation (LoRA) is a technique for fine-tuning models that has some advantages over previous methods:
March 23, 2023
Week 3 of LLaMA 🦙
A roundup of recent developments from the llamaverse.
March 18, 2023
Fine-tune LLaMA to speak like Homer Simpson
With a small amount of data and an hour of training you can make LLaMA output text in the voice of the dataset.
March 17, 2023
Train and run Stanford Alpaca on your own machine
We'll show you how to train Alpaca, a fine-tuned version of LLaMA that can respond to instructions like ChatGPT.
March 16, 2023
Machine learning needs better tools
Lots of people want to build things with machine learning, but they don't have the expertise to use it.
February 21, 2023
Introducing LoRA: A faster way to fine-tune Stable Diffusion
It's like DreamBooth, but much faster. And you can run it in the cloud on Replicate.
February 7, 2023
Train and deploy a DreamBooth model on Replicate
With just a handful of images and a single API call, you can train a model, publish it to Replicate, and run predictions on it in the cloud.
November 21, 2022
Run Stable Diffusion on your M1 Mac’s GPU
How to run Stable Diffusion locally so you can hack on it
August 31, 2022
Run Stable Diffusion with an API
How to use Replicate to integrate Stable Diffusion into hacks, apps, and projects
August 29, 2022
Build a robot artist for your Discord server with Stable Diffusion, Replicate, and Fly.io
A tutorial for building a chat bot that replies to prompts with the output of a text-to-image model.
August 25, 2022
Join us at Uncanny Spaces
We're bringing people together to explore what's being created with machine learning.
August 11, 2022
Automating image collection
Using CLIP and LAION5B to collect thousands of captioned images.
August 5, 2022
Illustrating the news with AI
Creating a web app to illustrate news headlines with AI-generated visualizations
July 28, 2022
Exploring text to image models
The basics of using the API to create your own images from text.
July 18, 2022
A new template for model READMEs
Inspired by model cards, we've created templates for documenting models on Replicate.
July 5, 2022
An introduction to differentiable programming and the process of refining generative art models.
May 27, 2022
We're a small team of engineers and machine learning enthusiasts working to make machine learning more accessible.
May 16, 2022