andreasjansson/clip-features
Return CLIP features for the clip-vit-large-patch14 model
106.6M runs
openai/whisper
Convert speech in audio to text
126M runs
jaaari/kokoro-82m
Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)
44.9M runs
vaibhavs10/incredibly-fast-whisper
whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗
15.8M runs
qwen/qwen-image-edit-plus
The latest Qwen-Image’s iteration with improved multi-image editing, single-image consistency, and native support for ControlNet
1.9K runs
openai/gpt-5-structured
GPT-5 with support for structured outputs, web search and custom tools
58.6K runs
google/nano-banana
Google's latest image editing model in Gemini 2.5
8.4M runs
pixverse/pixverse-v5
Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
20.4K runs
leonardoai/lucid-origin
Artistic and high-quality visuals with improved prompt adherence, diversity, and definition
18.8K runs
bytedance/seedream-4
Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
836.8K runs
qwen/qwen-image
An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.
467.7K runs
kwaivgi/kling-v2.1
Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)
1.7M runs
minimax/hailuo-02
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.
110K runs
deepseek-ai/deepseek-v3.1
Latest hybrid thinking model from Deepseek
4.2K runs
wan-video/wan-2.2-t2v-fast
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video
72.4K runs
prunaai/wan-2.2-image
This model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package
360.5K runs
Official models are always on, maintained, and have predictable pricing.
The latest Qwen-Image’s iteration with improved multi-image editing, single-image consistency, and native support for ControlNet
GPT-5 with support for structured outputs, web search and custom tools
A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps
Generate realistic lipsync animations from audio for high-quality synchronization
Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.
A multimodal image generation model that creates high-quality images. You need to bring your own verified OpenAI key to use this model. Your OpenAI account will be charged for usage.
Generate 5s and 10s videos in 720p resolution at 30fps
Generate 5s and 10s videos in 720p resolution
Bria Increase resolution upscales the resolution of any image. It increases resolution using a dedicated upscaling method that preserves the original image content without regeneration.
A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.
2.5 billion parameter image model with improved MMDiT-X architecture
A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
OpenAI's new model excelling at coding, writing, and reasoning.
A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
Fast, high quality text-to-video and image-to-video (Also known as Dream Machine)
Turns your audio/video/images into professional-quality animated videos
A new way to edit, transform and generate video
Automated background removal for images. Tuned for AI-generated content, product photos, portraits, and design workflows
Granite-3.3-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for improved reasoning and instruction-following capabilities.
Google's latest image generation model in Gemini 2.5
Use AI To Generate Images & Photos with an API
Use AI To Caption Videos with an API
Convert text to speech
Make realistic images of people instantly
Use AI To Generate Videos with an API
Upscaling models that create high-quality images from low-quality images
Use AI To Generate Music with an API
Use AI To Edit Any Image with an API
Models that convert speech to text
Optical character recognition (OCR) and text extraction
Models that remove backgrounds from images and videos
The FLUX family of text-to-image models from Black Forest Labs
Models that improve or restore images by deblurring, colorization, and removing noise
Upscaling models that create high-quality video from low-quality videos
Use AI To Lipsync videos with an API
Browse the diverse range of fine-tunes the community has custom-trained on Replicate
Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate
Models that can understand and generate text
Toolbelt-type models for videos and images.
Use AI To Caption Images with an API
Use AI To Generate Videos from images with an API
Generate videos with Wan, the fastest and highest quality open-source video generation model.
Ask language models about images
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.
Voice-to-voice cloning and musical prosody
Models that generate embeddings from inputs
Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.
Official models are always on, maintained, and have predictable pricing.
Models that detect or segment objects in images and videos.
Browse the diverse range of fine-tunes the community has custom-trained on Replicate
amrdiab568/charify-style
148 runs
qwen/qwen-image-edit-plus
The latest Qwen-Image’s iteration with improved multi-image editing, single-image consistency, and native support for ControlNet
2K runs
vyroteam/imagineart-1.0
A state-of-the-art Mixture of Experts (MoE) model for generating hyper-realistic images with unmatched detail, natural lighting, and photographic authenticity.
84 runs
hebhar/handsketch
Hand sketch style
22 runs
openai/gpt-5-structured
GPT-5 with support for structured outputs, web search and custom tools
58.6K runs
andyx1976/dress_shirt
plain mens dress shirts, mainly done to stop Flux Krea making all shirts kind of scruffy and old, white blue checked, slim fit, w tie and withot..
23 runs
stability-ai/stable-diffusion-3.5-large-turbo
A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps
795.5K runs
pixverse/lipsync
Generate realistic lipsync animations from audio for high-quality synchronization
27 runs
pixverse/pixverse-v4.5
Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.
157.9K runs
drjempower360pro/drjsykes-replicate-2
14 runs
openai/gpt-image-1
A multimodal image generation model that creates high-quality images. You need to bring your own verified OpenAI key to use this model. Your OpenAI account will be charged for usage.
506.6K runs
kwaivgi/kling-v1.6-standard
Generate 5s and 10s videos in 720p resolution at 30fps
1.2M runs