Explore
lucataco / animate-diff
Animate Your Personalized Text-to-Image Diffusion Models

andreasjansson / illusion
Monster Labs' control_v1p_sd15_qrcode_monster ControlNet on top of SD 1.5

sigil-wen / xtts
XTTS: Multilingual Text To Speech Voice Cloning Model by Coqui

meta / llama-2-70b-chat
A 70 billion parameter language model from Meta, fine tuned for chat completions

stability-ai / sdxl
A text-to-image generative AI model that creates beautiful 1024x1024 images

meta / codellama-13b
A 13 billion parameter Llama tuned for code completion
Collections
Audio generation
Models to generate and modify audio
riffusion/riffusion , meta/musicgen , suno-ai/bark , afiaka87/tortoise-tts , allenhung1025/looptest ...
ControlNet
Control diffusion models
jagilley/controlnet-scribble , jagilley/controlnet-hough , jagilley/controlnet-canny , jagilley/controlnet-depth2img , jagilley/controlnet-hed ...
Diffusion models
Image and video generation models trained with diffusion processes
stability-ai/stable-diffusion , cjwbw/anything-v3-better-vae , cjwbw/anything-v4.0 , cjwbw/waifu-diffusion , tommoore515/material_stable_diffusion ...
Embedding models
Models that generate embeddings from inputs
andreasjansson/clip-features , replicate/all-mpnet-base-v2 , daanelson/imagebind ...
Image editing
Tools for manipulating images.
tencentarc/gfpgan , sczhou/codeformer , rossjillian/controlnet , cjwbw/rembg , andreasjansson/stable-diffusion-inpainting ...
Image restoration
Models that improve or restore images by deblurring, colorization, and removing noise
tencentarc/gfpgan , jingyunliang/swinir , microsoft/bringing-old-photos-back-to-life , google-research/maxim , cjwbw/bigcolor ...
Image to text
Models that generate text prompts and captions from images
salesforce/blip , andreasjansson/blip-2 , methexis-inc/img2prompt , rmokady/clip_prefix_caption , pharmapsychotic/clip-interrogator ...
Language models
Models that can understand and generate text
meta/llama-2-13b-chat , meta/llama-2-70b-chat , meta/llama-2-7b-chat , replicate/vicuna-13b , replicate/dolly-v2-12b ...
ML makeovers
Models that let you change facial features
orpatashnik/styleclip , yuval-alaluf/sam , rinongal/stylegan-nada , mchong6/jojogan , yuval-alaluf/restyle_encoder ...
SDXL fine-tunes
Some of our favorite SDXL fine-tunes.
fofr/sdxl-emoji , fofr/sdxl-barbie , fofr/sdxl-2004 , fofr/sdxl-tron , pwntus/sdxl-gta-v ...
Streaming language models
Language models that support streaming responses. See https://replicate.com/docs/streaming
meta/llama-2-13b-chat , meta/llama-2-70b-chat , meta/llama-2-7b-chat , replicate/vicuna-13b , replicate/dolly-v2-12b ...
Style transfer
Models that take a content image and a style reference to produce a new image
huage001/adaattn , paper11667/clipstyler , ptran1203/pytorch-animegan , sanzgiri/cartoonify_video , ariel415el/gpdm ...
Super resolution
Upscaling models that create high-quality images from low-quality images
nightmareai/real-esrgan , jingyunliang/swinir , mv-lab/swin2sr , cjwbw/real-esrgan , cjwbw/rudalle-sr ...
Text to image
Models that generate images from text prompts
stability-ai/stable-diffusion , pixray/text2image , cjwbw/waifu-diffusion , kuprel/min-dalle , laion-ai/erlich ...
Trainable language models
Language models that you can fine-tune using Replicate's training API.
meta/llama-2-13b-chat , meta/llama-2-70b-chat , meta/llama-2-7b-chat , meta/llama-2-7b , meta/llama-2-70b ...
Videos
Models that create and edit videos
deforum/deforum_stable_diffusion , andreasjansson/stable-diffusion-animation , cjwbw/damo-text-to-video , nateraw/stable-diffusion-videos , cjwbw/text2video-zero ...
Popular models
A latent text-to-image diffusion model capable of generating photo-realistic images given any text input
Practical face restoration algorithm for *old photos* or *AI-generated faces*
Stable Diffusion fine tuned on Midjourney v4 images.
Real-ESRGAN with optional face correction and adjustable upscale
Bootstrapping Language-Image Pre-training
multilingual text2image latent diffusion model
Generate detailed images from scribbled drawings
Latest models
Turn your selfies into a professional headshot in seconds - proHeadshot.pics
📽️ Increase Framerate 🎬 ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation
Generate "algorithmic symphony" one-liners with grammar-constrained CodeLlama
Monster Labs' control_v1p_sd15_qrcode_monster ControlNet on top of SD 1.5
A facial animation generator model based on diffusion
Fine tuned to generate awesome app icons, by aistartupkit.com
Fast Diffusion for Image Generation, ~3 Seconds
A stylized poster style for text-to-image
Voice cloning with just a 3-second audio clip
Product advertising image generator using SDXL
A SDXL fine-tune based on the cyberpunk style
SDXL finetune to generate curvy minimalist shapes and patterns. WIP . Trained on AI generated images that had South Indian alphabets as central motif in prompts to achieve satisfactory rotundness.
XTTS: Multilingual Text To Speech Voice Cloning Model by Coqui
SDXL finetuned to desi truck art style
finetuned sdxl 1.0 with food named 'gongbao chicken' images
A SDXL fine-tune based on scenic Van Gogh paintings
TOK trained on 1960 coloring books.
stream previews as it is generated
SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
A 70 billion parameter language model from Meta, fine tuned for chat completions
Base version of Llama 2, a 70 billion parameter language model from Meta.
A 13 billion parameter language model from Meta, fine tuned for chat completions
Base version of Llama 2 13B, a 13 billion parameter language model
A 7 billion parameter language model from Meta, fine tuned for chat completions
Base version of Llama 2 7B, a 7 billion parameter language model
llama-13b-base fine-tuned on Neuromancer style
sdxl-1.0 finetuned with 'Bundaberg' bottle beverage images
Implementation of SDXL RealVisXL_V1.0