Explore

stability-ai / sdxl
A text-to-image generative AI model that creates beautiful 1024x1024 images

meta / codellama-13b
A 13 billion parameter Llama tuned for code completion
lucataco / animate-diff
Animate Your Personalized Text-to-Image Diffusion Models

andreasjansson / illusion
Monster Labs' control_v1p_sd15_qrcode_monster ControlNet on top of SD 1.5

sigil-wen / xtts
XTTS: Multilingual Text To Speech Voice Cloning Model by Coqui

meta / llama-2-70b-chat
A 70 billion parameter language model from Meta, fine tuned for chat completions
Collections
Audio generation
Models to generate and modify audio
riffusion/riffusion , meta/musicgen , suno-ai/bark , afiaka87/tortoise-tts , allenhung1025/looptest ...
ControlNet
Control diffusion models
jagilley/controlnet-scribble , jagilley/controlnet-hough , jagilley/controlnet-canny , jagilley/controlnet-depth2img , jagilley/controlnet-hed ...
Diffusion models
Image and video generation models trained with diffusion processes
stability-ai/stable-diffusion , cjwbw/anything-v3-better-vae , cjwbw/anything-v4.0 , cjwbw/waifu-diffusion , tommoore515/material_stable_diffusion ...
Embedding models
Models that generate embeddings from inputs
andreasjansson/clip-features , replicate/all-mpnet-base-v2 , daanelson/imagebind ...
Image editing
Tools for manipulating images.
tencentarc/gfpgan , sczhou/codeformer , rossjillian/controlnet , cjwbw/rembg , andreasjansson/stable-diffusion-inpainting ...
Image restoration
Models that improve or restore images by deblurring, colorization, and removing noise
tencentarc/gfpgan , jingyunliang/swinir , microsoft/bringing-old-photos-back-to-life , google-research/maxim , cjwbw/bigcolor ...
Image to text
Models that generate text prompts and captions from images
salesforce/blip , andreasjansson/blip-2 , methexis-inc/img2prompt , rmokady/clip_prefix_caption , pharmapsychotic/clip-interrogator ...
Language models
Models that can understand and generate text
meta/llama-2-13b-chat , meta/llama-2-70b-chat , meta/llama-2-7b-chat , replicate/dolly-v2-12b , replicate/vicuna-13b ...
ML makeovers
Models that let you change facial features
orpatashnik/styleclip , yuval-alaluf/sam , rinongal/stylegan-nada , mchong6/jojogan , yuval-alaluf/restyle_encoder ...
SDXL fine-tunes
Some of our favorite SDXL fine-tunes.
fofr/sdxl-emoji , fofr/sdxl-barbie , fofr/sdxl-2004 , fofr/sdxl-tron , pwntus/sdxl-gta-v ...
Streaming language models
Language models that support streaming responses. See https://replicate.com/docs/streaming
meta/llama-2-13b-chat , meta/llama-2-70b-chat , meta/llama-2-7b-chat , replicate/dolly-v2-12b , replicate/vicuna-13b ...
Style transfer
Models that take a content image and a style reference to produce a new image
huage001/adaattn , paper11667/clipstyler , ptran1203/pytorch-animegan , sanzgiri/cartoonify_video , ariel415el/gpdm ...
Super resolution
Upscaling models that create high-quality images from low-quality images
nightmareai/real-esrgan , jingyunliang/swinir , mv-lab/swin2sr , cjwbw/real-esrgan , cjwbw/rudalle-sr ...
Text to image
Models that generate images from text prompts
stability-ai/stable-diffusion , pixray/text2image , cjwbw/waifu-diffusion , kuprel/min-dalle , laion-ai/erlich ...
Trainable language models
Language models that you can fine-tune using Replicate's training API.
meta/llama-2-13b-chat , meta/llama-2-70b-chat , meta/llama-2-7b-chat , meta/llama-2-7b , meta/llama-2-70b ...
Videos
Models that create and edit videos
deforum/deforum_stable_diffusion , andreasjansson/stable-diffusion-animation , cjwbw/damo-text-to-video , nateraw/stable-diffusion-videos , cjwbw/text2video-zero ...
Popular models
A latent text-to-image diffusion model capable of generating photo-realistic images given any text input
Stable Diffusion fine tuned on Midjourney v4 images.
Practical face restoration algorithm for *old photos* or *AI-generated faces*
Real-ESRGAN with optional face correction and adjustable upscale
multilingual text2image latent diffusion model
A 13 billion parameter language model from Meta, fine tuned for chat completions
Latest models
A text-to-image generative AI model that creates beautiful 1024x1024 images
BrawnyAi's testing new SDXL training capabilities
🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
(Alpha) GPU accelerated replay renderer for comma.ai connect's openpilot route data
Advanced text-image comprehension and composition based on InternLM
Turn your selfies into a professional headshot in seconds - proHeadshot.pics
qr2ai AI-Generated QR Codes with Stable Diffusion
WIP - Trained on Bing Dalle ( 2.5 ) generations with glitch collage aesthetic
A Fine-Tuned Version of SDXL 1.0 to replicate Midjourny like images.
Segment Anything Model with point prompt.
Transcribes any audio file (file, base64 or url) with speaker diarization. *Please read instructions below*
Take a video and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training.
Nougat: Neural Optical Understanding for Academic Documents
batch inference for dreambooth trainings including openpose
SDXL fine tuned on minimalist design by imageapp.xyz
A 7 billion parameter language model from Mistral.
An instruction-tuned 7 billion parameter language model from Mistral
A 34 billion parameter Llama tuned for coding and conversation
A 13 billion parameter Llama tuned for coding and conversation
A 7 billion parameter Llama tuned for coding and conversation
A 34 billion parameter Llama tuned for coding with Python
A 13 billion parameter Llama tuned for coding with Python
A 7 billion parameter Llama tuned for coding with Python
A 34 billion parameter Llama tuned for coding and conversation
A 13 billion parameter Llama tuned for code completion
A 7 billion parameter Llama tuned for coding and conversation
Modify images using depth maps
Modify images using sketches
Modify images using canny edges
Modify images using line art
Modify images using human pose
not as good as https://replicate.com/wolverinn/realistic-background