3D models

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

alaradirik/dreamgaussian , jd7h/zero123plusplus , adirik/wonder3d , adirik/mvdream , adirik/mvdream-multi-view ...

Streaming language models

Language models that support streaming responses. See https://replicate.com/docs/streaming

meta/llama-2-70b-chat , meta/llama-2-13b-chat , meta/llama-2-7b-chat , yorickvp/llava-13b , fofr/prompt-classifier ...

Super resolution

Upscaling models that create high-quality images from low-quality images

nightmareai/real-esrgan , jingyunliang/swinir , mv-lab/swin2sr , cjwbw/real-esrgan , cjwbw/rudalle-sr ...

Vision models

Multimodal large language models with vision capabilities like object detection and optical character recognition (OCR)

yorickvp/llava-13b , daanelson/minigpt-4 , cjwbw/internlm-xcomposer , joehoover/mplug-owl , lucataco/qwen-vl-chat ...

Latest models

Playground v2 is a diffusion-based text-to-image generative model trained from scratch. Try out all 3 models here

Chest X ray

Create driving poses for magic-animate

BSHM 人像抠图

A fine-tuned SDXL based on GTA V art

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

SVD + in-painting

A ~7B parameter language model from Deepseek for SOTA repository level code completion

Source: Pclanglais/MonadGPT ✦ Quant: TheBloke/MonadGPT-AWQ ✦ What would have happened if ChatGPT was invented in the 17th century?

Playground v2 is a diffusion-based text-to-image generative model trained from scratch by the research team at Playground

Source: fblgit/una-cybertron-7b-v2-bf16 ✦ Quant: TheBloke/una-cybertron-7B-v2-AWQ ✦ A 7B MistralAI based model, best on it's series. Trained on SFT, DPO and UNA (Unified Neural Alignment) on multiple datasets

Inference SDXL with cog including multiple models in 1 instance support.

Generate plasma shader equations

Translate audio while keeping the original style, pronunciation and tone of your original audio.

Convert your videos to DensePose and use it with MagicAnimate

Add a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI

GoogleAI: Style Aligned Image Generation via Shared Attention

Extracts motion from video

Remix the music into another styles with MusicGen Chord

https://www.photoaistudio.com. Take a picture of your face and instantly get any profile picture you want. Only 1 photo, no training needed.

Train your own custom RVC model

A fine-tuned SDXL LoRA trained on Georgia O'keeffe art

https://www.interioraidesigns.com. Take a picture of your room and see how your room looks in different themes. Remodel your room today.

Transcribes any audio file with speaker diarization. *Please check the README*

Deliberate V5 Model (Text2Img, Img2Img and Inpainting)

Counterfeit XL v2 Model (Text2Img, Img2Img and Inpainting)

Edit real or generated images

Edit real or generated images

Simple model to make addition and answer is send to supabase

highist resolutioin image

Juggernaut XL v7 Model (Text2Img, Img2Img and Inpainting)

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Generate color codes for prominent colors in the image

Given two images depicting a source structure and a target appearance, generate an image merging the structure of one image with the appearance of the other

Source: chargoddard/loyal-piano-m7 ✦ Quant: TheBloke/loyal-piano-m7-AWQ ✦ Intended to be a roleplay-focused model with some smarts and good long-context recall

A fine-tuned SDXL LoRA trained on images of stealth planes

Notus-7b-v1 model

Real-ESRGAN Upscale with AI Face Correction

