Explore
zsxkib/pulid
📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment
meta/meta-llama-guard-2-8b
A llama-3 based moderation and safeguarding language model
cjwbw/openvoice
Updated to OpenVoice v2: Versatile Instant Voice Cloning
fofr/video-morpher
Generate a video that morphs between subjects, with an optional style
snowflake/snowflake-arctic-instruct
An efficient, intelligent, and truly open-source language model
meta/meta-llama-3-70b-instruct
A 70 billion parameter language model from Meta, fine tuned for chat completions
I want to…
Generate images
Models that generate images from text prompts
Edit images
Tools for manipulating images.
Caption images
Models that generate text from images
Restore images
Models that improve or restore images by deblurring, colorization, and removing noise
Use a language model
Models that can understand and generate text
Upscale images
Upscaling models that create high-quality images from low-quality images
Get embeddings
Models that generate embeddings from inputs
Extract text from images
Optical character recognition (OCR) and text extraction
Train a language model
Language models that you can fine-tune using Replicate's training API.
Use a face to make images
Make realistic images of people instantly
Chat with images
Ask language models about images
Transcribe speech
Models that convert speech to text
Use handy tools
Toolbelt-type models for videos and images.
Generate music
Models to generate and modify music
Generate videos
Models that create and edit videos
Generate speech
Convert text to speech
Make 3D stuff
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Get structured data
Language models that support grammar-based decoding as well as jsonschema constraints.
Popular models
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Real-ESRGAN with optional face correction and adjustable upscale
A text-to-image generative AI model that creates beautiful images
Visual instruction tuning towards large language and vision models with GPT-4 level capabilities
Practical face restoration algorithm for *old photos* or *AI-generated faces*
Practical face restoration algorithm for *old photos* or *AI-generated faces*
Latest models
InstantID. ControlNets. More base SDXL models. And the latest ByteDance's ⚡️SDXL-Lightning !⚡️
The img2img pipeline that makes an anime-style image of a person. It uses one of sd1.5 models as a base, depth-estimation as a ControleNet and IPadapter model for face consistency.
GPU accelerated replay renderer / video data clipper for comma.ai connect's openpilot route data. SEE README.
Consistent Self-Attention for Long-Range Image and Video Generation
Robust face restoration algorithm for old photos / AI-generated faces (adapted to work with video inputs)
A SDXL Model trained from another SDXL-hiroshinagai model images
Train on RealVisXL 4.0 (Realistic Vision XL 4) | Mixed precision bf16 any LoRA
Just some good ole beautifulsoup scrapping URL magic. (some sites don't work as they block scrapping, but still useful)
Projection module trained to add vision capabilties to Llama 3 using SigLIP
llava-phi-3-mini is a LLaVA model fine-tuned from microsoft/Phi-3-mini-4k-instruct
PyTorch implementation of AnimeGAN for fast photo animation
Function calling with llama-3 with prompting only.
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
AbsoluteReality V1.8.1 Model (Text2Img, Img2Img and Inpainting)
Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets
Phi-3-Mini-4K-Instruct is a 3.8B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets
This is wizard-vicuna-13b trained with a subset of the dataset - responses that contained alignment / moralizing were removed
Newest reranker model from BAAI (https://huggingface.co/BAAI/bge-reranker-v2-m3). FP16 inference enabled. Normalize param available
Best-in-class clothing virtual try on in the wild (non-commercial use only)