prunaai/flux.1-dev
This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai
20.4M runs
jaaari/kokoro-82m
Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)
42.8M runs
xinntao/gfpgan
Practical face restoration algorithm for *old photos* or *AI-generated faces*
40.1M runs
zsxkib/mmaudio
Add sound to video using the MMAudio V2 model. An advanced AI model that synthesizes high-quality audio from video content, enabling seamless video-to-audio transformation.
3M runs
pixverse/pixverse-v5
Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
12.9K runs
google/nano-banana
Google's latest image editing model in Gemini 2.5
4.9M runs
bytedance/seedream-4
Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
195.6K runs
leonardoai/lucid-origin
Artistic and high-quality visuals with improved prompt adherence, diversity, and definition
2.9K runs
openai/gpt-5-structured
GPT-5 with support for structured outputs, web search and custom tools
36.3K runs
qwen/qwen-image
An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.
357.6K runs
kwaivgi/kling-v2.1
Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)
1.2M runs
minimax/hailuo-02
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.
90.8K runs
deepseek-ai/deepseek-v3.1
Latest hybrid thinking model from Deepseek
2.6K runs
qwen/qwen-image-edit
Edit images using a prompt. This model extends Qwen-Image’s unique text rendering capabilities to image editing tasks, enabling precise text editing
283.6K runs
wan-video/wan-2.2-t2v-fast
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video
66.6K runs
prunaai/wan-2.2-image
This model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package
290K runs
Official models are always on, maintained, and have predictable pricing.
Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
A new way to edit, transform and generate video
Google's latest image editing model in Gemini 2.5
Professional edge-guided image generation. Control structure and composition using Canny edge detection
Generate a video from an audio clip and a reference image
Generate 5s and 9s 720p videos, faster and cheaper than Ray 2
Music-1.5: Full-length songs (up to 4 mins) with natural vocals & rich instrumentation
Generate videos with specific camera movements
Generate 5s and 10s videos in 1080p resolution
Upscale videos by 4x, up to a maximum of 4k
Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
Turns your audio/video/images into professional-quality animated videos
Bria AI's remove background model
Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand to the desired aspect ratio. Trained exclusively on licensed data for safe and risk-free commercial use
An AI system that can create realistic images and art from a description in natural language.
Use this ultra version of Imagen 4 when quality matters more than speed and cost
A faster and cheaper version of Google’s Veo 3 video model, with audio
Generate high-quality music and sound from text prompts
Add consistent, customizable shadows to product cutouts for enhanced visual appeal
Transform any product photo into professional 2000x2000px packshots with optimal positioning
Use AI To Generate Images & Photos with an API
Use AI To Caption Videos with an API
Convert text to speech
Make realistic images of people instantly
Use AI To Generate Videos with an API
Upscaling models that create high-quality images from low-quality images
Use AI To Generate Music with an API
Use AI To Edit Any Image with an API
Models that convert speech to text
Optical character recognition (OCR) and text extraction
Models that remove backgrounds from images and videos
The FLUX family of text-to-image models from Black Forest Labs
Models that improve or restore images by deblurring, colorization, and removing noise
Upscaling models that create high-quality video from low-quality videos
Browse the diverse range of fine-tunes the community has custom-trained on Replicate
Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate
Models that can understand and generate text
Toolbelt-type models for videos and images.
Use AI To Caption Images with an API
Use AI To Generate Videos from images with an API
Generate videos with Wan, the fastest and highest quality open-source video generation model.
Ask language models about images
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.
Voice-to-voice cloning and musical prosody
Models that generate embeddings from inputs
Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.
Official models are always on, maintained, and have predictable pricing.
Models that detect or segment objects in images and videos.
Browse the diverse range of fine-tunes the community has custom-trained on Replicate
zsxkib/easyocr
Extract text with pixel coordinates from screenshots and images. GPU-accelerated, multi-language, perfect for camera-translation overlays.
63 runs
zylim0702/yun-3d-2.1
75 runs
zylim0702/qwen-multi-view-advanced
125 runs
lhungting-ship-it/card-glare-detection
Detect the glare region on the card
6 runs
hautechai/json-to-media
49 runs
adirik/interior-design-v2
Transform empty rooms into furnished and stylized rooms
65 runs
shreejalmaharjan-27/anime4k
Anime4K is a set of open-source, high-quality real-time anime upscaling/denoising algorithms that can be implemented in any programming language.
2 runs
meatballhat/turtle-head
Puts a turtle on the head of an historical dipshit.
62 runs
frankxai/arcanea
The world best model to imagine the limitless world of Arcanea
35 runs
tencent/hunyuan-image-2.1
Generate high-quality 2K resolution images from text prompts
146 runs
deyoyk/asian
fund me so i can train with even more asian girls images :3 https://buymeacoffee.com/deyoyk
46 runs
superhighfives/song-exploder
From a song idea to stems
23 runs