Explore

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Flux fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Generate clay style images based on prompts or images

Updated 369 runs

GLM-4V is a multimodal model released by Tsinghua University that is competitive with GPT-4o and establishes a new SOTA on several benchmarks, including OCR.

Updated 65.9K runs

Convert speech in audio to text w/ `tiny`, `small`, `base`, and `large-v3` models

Updated 60 runs

Dolphin-2.9 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling

Updated 244 runs

Extended video synthesis model that generates 128 frames

Updated 197 runs

Image generation, Inpaint Strength, loras custom_urls and enhancer.

Updated 403 runs

Updated 33 runs

Updated 55 runs

Updated 19 runs

Hermes 2 Pro is an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house

Updated 140 runs

Best Open-Source Model for Function Calling

Updated 25 runs

Updated 31 runs

Hermes-2 Θ (Theta) is the first experimental merged model released by Nous Research

Updated 15 runs

Hermes 2 Pro is an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house

Updated 1.3K runs

Speech to speech with any RVC v2 trained AI voice

Updated 164K runs

hello world

Updated 44 runs

Google's Gemma2 27b instruct model

Updated 1.8K runs

AuraSR: GAN-based Super-Resolution for real-world

Updated 1.7K runs

Google's Gemma2 9b instruct model

Updated 3.6K runs

Model

Updated 409 runs

A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Updated 761 runs

Model that generates Cartoon like characters

Updated 738 runs

Stable Diffusion 3 with Differential Diffusion inpainting (experimental)

Updated 263 runs

Fork of https://replicate.com/schananas/grounded_sam that uses OwlV2 instead of Grounding Dino

Updated 544 runs

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Updated 33.5K runs

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Updated 27.9K runs

Updated 461 runs

Qwen 2: A 7 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 1.6K runs

Qwen 2: A 1.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 201 runs

A novel speech model for insane prosody.

Updated 280 runs

Qwen 2: A 0.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 179 runs

good for video teaser backsound

Updated 52 runs

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 6.3M runs

Updated 505 runs

✨Stable Diffusion 3 w/ ⚡InstantX's Canny, Pose, and Tile ControlNets🖼️

Updated 996 runs

A model for experimenting with all the SD3 settings. Non-commercial use only, unless you have a Stability AI Self Hosted License.

Updated 31.8K runs

Updated 168 runs

Stable Audio Open is an open-source model optimized for generating short audio samples, sound effects, and production elements using text prompts.

Updated 3.9K runs

Updated 80 runs

Stable Diffusion 3 medium with added variability in outputs. Non-commercial use only, unless you have a Stability AI Self Hosted License.

Updated 20.2K runs

Transcribe saxophone solos directly from audio

Updated 149 runs

Updated 246 runs

Real-Time Open-Vocabulary Object Detection using the xl weights

Updated 316 runs

MusicGen running on an a40 with 60 seconds max duration

Updated 393 runs

Updated 169 runs

DOVER video quality assessment tool, assigning videos both aesthetic and technical quality scores

Updated 25 runs

Generate Product photography backgrounds using Stable Diffusion

Updated 393 runs

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation. Hologram optimized

Updated 303 runs

Transfer learning models for music classification by genres, moods, and instrumentation

Updated 9.7K runs

🫦 Realistic facial expression manipulation (lip-syncing) using audio or video

Updated 744 runs