

beautyyuyanli/multilingual-e5-large
multilingual-e5-large: A multi-language text embedding model
33.8M runs


prunaai/flux-fast
This is the fastest Flux endpoint in the world.
33.8M runs


jaaari/kokoro-82m
Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)
60.6M runs


andreasjansson/clip-features
Return CLIP features for the clip-vit-large-patch14 model
121.8M runs

google/gemini-3-pro
Google's most advanced reasoning Gemini model
2.8K runs


openai/gpt-5.1
The best model for coding and agentic tasks with configurable reasoning effort.
35.9K runs

google/nano-banana-pro
Google's state of the art image generation and editing model 🍌🍌
174.4K runs

qwen/qwen-image-edit-plus-lora
Qwen Image Edit 2509 LoRA explorer, uses HuggingFace URLs to load any safetensor
16K runs

retro-diffusion/rd-plus
High quality and authentic pixel art image generation
1.4K runs


zsxkib/seedvr2
🔥 SeedVR2: one-step video & image restoration with 3B/7B hot‑swap and optional color fix 🎬✨
4.5K runs
minimax/hailuo-2.3
A high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and strong prompt and style adherence across both text-to-video and image-to-video workflows
10.1K runs

lightricks/ltx-2-fast
Ideal for rapid ideation and mobile workflows. Perfect for creators who need instant feedback, real-time previews, or high-throughput content.
21.1K runs
google/veo-3.1
New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support
86K runs


openai/sora-2
OpenAI's Flagship video generation with synced audio
112.5K runs


bytedance/seedream-4
Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
13.5M runs


philz1337x/crystal-upscaler
High-precision image upscaler optimized for portraits, faces and products. One of the upscale modes powered by Clarity AI. X:https://x.com/philz1337x
189.4K runs
Official models are always on, maintained, and have predictable pricing.

Google's most advanced reasoning Gemini model

Generate complex 3D models from images with Rodin Gen-2

The best model for coding and agentic tasks with configurable reasoning effort.

Fusion – Product/object blending that fixes perspective and lighting so the subject melts into a new background via the Fusion LoRA.

Relight – Soft, curtain-filtered relighting that repaints the scene with golden-hour or moody tones using the Relight LoRA.

Upscale – Detail-loving upscale/restore pass that sharpens textures and color fidelity with the Upscale LoRA.

Next Scene – “Next beat” cinematic edits that keep subject identity while steering to the next camera move via the Next Scene LoRA

Skin – Natural beauty retouch that enhances pores and tonal variation (no plastic skin) via the Skin LoRA.

Photo to Anime – Stylized conversion that turns photos into crisp cel-shaded anime frames using the Photo-to-Anime LoRA.

an open-source, 2B-parameter model built for real-world applications

Kimi K2 Thinking is the latest, most capable version of an open-source thinking model.

Google's state of the art image generation and editing model 🍌🍌

Qwen Image Edit 2509 LoRA explorer, uses HuggingFace URLs to load any safetensor

Reve's fast image edit model at only $0.01 per edit

Camera-aware edits for Qwen/Qwen-Image-Edit-2509 with Lightning + multi-angle LoRA

Style consistent animated pixel art sprite generation

All the tools you need for generating pixel art tilesets

High quality and authentic pixel art image generation

Fast pixel art image generation

MiniMax Speech 2.6 HD delivers studio-quality multilingual text-to-audio on Replicate with nuanced prosody, subtitle export, and premium voices
Use AI to generate images & photos with an API
Use AI to caption videos with an API
Use AI for text-to-speech or to clone your voice via API
Use AI to generate images from a face with an API
Use AI to generate videos with an API
Use AI to upscale images with super resolution with an API
Use AI to generate music with an API
Use AI to edit any image via API
Use AI to transcribe speech to text via API
Use AI For Optical Character Recognition (OCR) to extract text from images via API
Use AI to remove backgrounds from images and videos with an API
FLUX AI models: advanced image generation & editing via API
Use AI to restore images via API
Use AI to enhance videos via API - Replicate
Detect NSFW content in images and text
Classify text by sentiment, topic, intent, or safety
Identify speakers from audio and video inputs
Replace faces across images with natural-looking results.
Transform rough sketches into polished visuals
Generate custom emojis from text or images
Create anime-style characters, scenes, and animations
Use AI to create 3D content with an API
Chat with images for understanding, captioning & detection via API
Use AI to Generate Videos from Images with API
Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API
Try AI Models for free: video generation, image generation, upscaling, and photo restoration
Use AI to control image generation with an API
Embedding models for AI search and analysis
Use AI to edit your videos with an API
Use AI object detection and segmentation models to distinguish objects in images & videos
Use AI to generate lipsync videos with an API
Official AI models: Always available, stable, and predictably priced
Flux fine-tunes: build and run custom AI image models via API
Kontext fine-tunes: Build custom AI image models with an API
Create songs with voice cloning models via API
AI media utilities: auto-caption, watermark, frame extraction & more via API
Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.
WAN family of models: powerful image-to-video & text-to-video models
Use AI To Caption Images with an API


press1209/80s-1.0
30 runs


andreasjansson/pixel-art-downscaler
Simple algorithm to downscale a pixel art with slightly irregular grid to the smallest non-pixelated image
2 runs

playmore/speech-enhancer
Speech Enhancer
113 runs


clipnpaper/alcohol_cozy_bar
wine bar background
6 runs

playmore/resemble-denoiser
Denoiser from Resemble
40 runs

google/gemini-3-pro
Google's most advanced reasoning Gemini model
2.8K runs

subformer/video-dubbing
Translate audios and videos into 100+ languages with natural speech, voice cloning, and accurate timing. This is a DEMO version. Visit subformer.com for the full verison.
43 runs

hjunior29/video-text-remover
Clean videos by automatically removing text overlays
37 runs

hyper3d/rodin
Generate complex 3D models from images with Rodin Gen-2
174 runs

zeke/cloudflare-goo
Generate Replicate goo™ in the Cloudflare style 🧡
45 runs


lindstrom-ux/nanaiavatar1
9 runs


rolin44342/angelot45
18 runs