Explore
iordcalin/material-transfer
Transfer a material from an image to a subject
cjwbw/openvoice
Updated to OpenVoice v2: Versatile Instant Voice Cloning
snowflake/snowflake-arctic-instruct
An efficient, intelligent, and truly open-source language model
meta/meta-llama-3-70b-instruct
A 70 billion parameter language model from Meta, fine tuned for chat completions
meta/meta-llama-3-8b-instruct
An 8 billion parameter language model from Meta, fine tuned for chat completions
vaibhavs10/incredibly-fast-whisper
whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗
I want to…
Generate images
Models that generate images from text prompts
Edit images
Tools for manipulating images.
Restore images
Models that improve or restore images by deblurring, colorization, and removing noise
Caption images
Models that generate text from images
Get embeddings
Models that generate embeddings from inputs
Upscale images
Upscaling models that create high-quality images from low-quality images
Use a language model
Models that can understand and generate text
Extract text from images
Optical character recognition (OCR) and text extraction
Train a language model
Language models that you can fine-tune using Replicate's training API.
Use a face to make images
Make realistic images of people instantly
Chat with images
Ask language models about images
Use handy tools
Toolbelt-type models for videos and images.
Transcribe speech
Models that convert speech to text
Generate music
Models to generate and modify music
Generate videos
Models that create and edit videos
Generate speech
Convert text to speech
Make 3D stuff
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Get structured data
Language models that support grammar-based decoding as well as jsonschema constraints.
Latest models
A SDXL Model trained from another SDXL-hiroshinagai model images
Train SDXL 1.0 with LoRA | mixed precision bf16 and save precision fp16
Just some good ole beautifulsoup scrapping URL magic. (some sites don't work as they block scrapping, but still useful)
High resolution image Upscaler and Enhancer. Use at ClarityAI.cc. A free Magnific alternative. Twitter/X: @philz1337x
Tango2: LLM-guided Diffusion-based Text-to-Audio Generation and DPO-based Alignment
Run any ComfyUI workflow. Guide: https://github.com/fofr/cog-comfyui
Demucs is an audio source separator created by Facebook Research.
Projection module trained to add vision capabilties to Llama 3 using SigLIP
Dark Sushi Mix 2.25D Model with vae-ft-mse-840000-ema (Text2Img, Img2Img and Inpainting)
llava-phi-3-mini is a LLaVA model fine-tuned from microsoft/Phi-3-mini-4k-instruct
PyTorch implementation of AnimeGAN for fast photo animation
Function calling with llama-3 with prompting only.
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
AbsoluteReality V1.8.1 Model (Text2Img, Img2Img and Inpainting)
Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets
Phi-3-Mini-4K-Instruct is a 3.8B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets
This is wizard-vicuna-13b trained with a subset of the dataset - responses that contained alignment / moralizing were removed
Newest reranker model from BAAI (https://huggingface.co/BAAI/bge-reranker-v2-m3). FP16 inference enabled. Normalize param available
Best-in-class clothing virtual try on in the wild (non-commercial use only)
Generate a video that morphs between subjects, with an optional style
An efficient, intelligent, and truly open-source language model
Make stickers with AI. Generates graphics with transparent backgrounds.