Explore

I want to…

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives

Updated 5.6K runs

A simple OCR Model that can easily extract text from an image.

Updated 89.8M runs

Updated 345 runs

Video object segmentation for short and long videos

Updated 33 runs

Open diffusion model for high-quality video generation

Updated 10.3K runs

Synthesizing High-Resolution Images with Few-Step Inference

Updated 1.1M runs

Batch mode for text & image embeddings

Updated 68 runs

Tuning-free Higher-Resolution Visual Generation with Diffusion Models

Updated 1.1K runs

Generate videos from text prompts with Kandinsky-2.2

Updated 7.3K runs

SAM(Segment Anything) ViT-H image encoder

Updated 357.9K runs

Whisper-Large-V2 + Pyannote 3.0 diarization via WhisperX

Updated 95 runs

Anime Pastel Dream Model For Splurge Art

Updated 5.4K runs

Neurogen Model for Splurge Art

Updated 3.9K runs

Dreamlike Anime 1.0 for Splurge Art

Updated 6K runs

Dreamlike Photoreal Model for Splurge Art

Updated 3.2K runs

Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Updated 978 runs

TheBloke/Nous-Hermes-Llama2-AWQ served with vLLM

Updated 7.2K runs

Gender recognition for audio files

Updated 3.8K runs

cog-resnet example trial

Updated 8 runs

Make a transcription of a phone call

Updated 10 runs

Trained on plants

Updated 28 runs

My own personal copy of daanelson/whisperx

Updated 311 runs

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

Updated 796.9K runs

Mistral-7B-v0.1 fine tuned for chat with the OpenOrca dataset.

Updated 65.9K runs

Using a ComfyUI workflow to run SDXL text2img

Updated 442 runs

Zero-shot / open vocabulary object detection

Updated 23.5K runs

A high-performing language model trained to act as a helpful assistant

Updated 8K runs

Controlling Vision-Language Models for Universal Image Restoration

Updated 2.1K runs

✨DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Updated 132K runs

Updated 105 runs

Object removal, video completion and video outpainting

Updated 1.8K runs

Updated 22 runs

Updated 672 runs

Instruction tuned text-to-image diffusion models as vision generalists

Updated 356 runs

📽️ Increase Framerate 🎬 ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Updated 51.4K runs

Qwen-14B-Chat is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books, codes, etc.

Updated 5.4K runs

Embedding models that has been trained using Jina AI's Linnaeus-Clean dataset.

Updated 34 runs

Updated 206 runs

Stylized Audio-Driven Single Image Talking Face Animation

Updated 18.7K runs

Updated 53 runs

Text-to-gif using SDXL, with controlnet and lora support

Updated 3.7K runs

Hotshot XL using SDXL for generating one second clips of high quality! Running on a40 Made by the greats at hotshot.co and brought to you by your friends at FullJourney! Thanks to LucaTaco for the MVP!

Updated 4.4K runs

🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Updated 55.4K runs

Image restoration and face enhancement

Updated 18.5K runs

A ControlNet model designed to enhance the temporal consistency of generated outputs

Updated 128 runs

Updated 564 runs

BAAI's bge-en-large-v1.5 for embedding text sequences

Updated 294K runs

Updated 75 runs

Inst-Inpaint: Instructing to Remove Objects with Diffusion Models

Updated 514 runs

SDXL ControlNet - Canny

Updated 2.2M runs