beautyyuyanli/multilingual-e5-large
multilingual-e5-large: A multi-language text embedding model
28.6M runs
jaaari/kokoro-82m
Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)
50.2M runs
openai/whisper
Convert speech in audio to text
133.7M runs
andreasjansson/clip-features
Return CLIP features for the clip-vit-large-patch14 model
112.3M runs
openai/sora-2
OpenAI's Flagship video generation with synced audio
15.2K runs
google/nano-banana
Google's latest image editing model in Gemini 2.5
19M runs
tencent/hunyuan-image-3
A powerful native multimodal model for image generation (PrunaAI squeezed)
4.6K runs
ibm-granite/granite-4.0-h-small
Granite-4.0-H-Small is a 32B parameter long-context instruct model finetuned from Granite-4.0-H-Small-Base using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets.
2.6K runs
character-ai/ovi-i2v
Ovi: generate videos with audio from image and text inputs
2.6K runs
leonardoai/lucid-origin
Artistic and high-quality visuals with improved prompt adherence, diversity, and definition
66.8K runs
bytedance/seedream-4
Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
2.9M runs
anthropic/claude-4.5-sonnet
Claude Sonnet 4.5 is the best coding model to date, with significant improvements across the entire development lifecycle
7K runs
wan-video/wan-2.5-t2v
Alibaba Wan 2.5 text to video generation model
7.8K runs
pixverse/pixverse-v5
Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
173.7K runs
qwen/qwen-image-edit-plus
The latest Qwen-Image’s iteration with improved multi-image editing, single-image consistency, and native support for ControlNet
1M runs
openai/gpt-5
OpenAI's new model excelling at coding, writing, and reasoning.
347K runs
Official models are always on, maintained, and have predictable pricing.
Generate vivid, realistic images based on a text prompt. Excels at generating images for marketing, social media, and entertainment.
OpenAI's Most advanced synced-audio video generation
OpenAI's Flagship video generation with synced audio
AI-powered virtual try-on - see how clothes look on models instantly without physical photoshoots
120b open-weight language model from OpenAI
20b open-weight language model from OpenAI
A reasoning model trained with reinforcement learning, on par with OpenAI o1
Meta's flagship 405 billion parameter language model, fine-tuned for chat completions
A speech-to-text model that uses GPT-4o mini to transcribe audio
A speech-to-text model that uses GPT-4o to transcribe audio
Generate 5s and 9s 540p videos, faster and cheaper than Ray 2
Bring your subjects into focus with FLUX.1 Kontext [pro]
Add simple filters to your images
Put yourself in an iconic location around the world from a single image
Use flux-kontext-pro to change the first or last frame of a video. Useful to use as inputs for restyling an entire video in a certain way
Remove all text from an image with FLUX.1 Kontext
Use FLUX Kontext to restore, fix scratches and damage, and colorize old photos
Create a series of portrait photos from a single image
Become a character, in style
Turn your image into a cartoon with FLUX.1 Kontext [pro]
Use AI To Generate Images & Photos with an API
Use AI To Caption Videos with an API
Convert text to speech
Make realistic images of people instantly
Use AI To Generate Videos with an API
Upscaling models that create high-quality images from low-quality images
Use AI To Generate Music with an API
Use AI To Edit Any Image with an API
Models that convert speech to text
Optical character recognition (OCR) and text extraction
Models that remove backgrounds from images and videos
The FLUX family of text-to-image models from Black Forest Labs
Models that improve or restore images by deblurring, colorization, and removing noise
Upscaling models that create high-quality video from low-quality videos
Use AI To Generate Videos from images with an API
Use AI To Lipsync videos with an API
Browse the diverse range of fine-tunes the community has custom-trained on Replicate
Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate
Models that can understand and generate text
Toolbelt-type models for videos and images.
Use AI To Caption Images with an API
Generate videos with Wan, the fastest and highest quality open-source video generation model.
Ask language models about images
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.
Voice-to-voice cloning and musical prosody
Models that generate embeddings from inputs
Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.
Official models are always on, maintained, and have predictable pricing.
Models that detect or segment objects in images and videos.
Browse the diverse range of fine-tunes the community has custom-trained on Replicate
vufinder/vggt-1b-depth
Feed-forward neural network that directly infers all key 3D attributes of a scene.
50 runs
vufinder/vggt-1b-point
Feed-forward neural network that directly infers all key 3D attributes of a scene.
70 runs
vufinder/vggt-1b
Feed-forward neural network that directly infers all key 3D attributes of a scene.
38 runs
espressotechie/qwen-imgedit-4bit
Qwen image edit fast
79 runs
the-making-company/modezty-influenser-2
2 runs
fredybalibuno/manhwa-style-lora
Generates detailed, story-driven manhwa scenes with rich lighting, expressive characters, and dynamic poses.
27 runs
xai/grok-2-image
Generate vivid, realistic images based on a text prompt. Excels at generating images for marketing, social media, and entertainment.
238 runs
marcobitboss274/marcocandelabro
Monochromatic, high-resolution Rococo-Baroque engraving with dense, vertical organic geometry defined by sinuous calligraphic lines, golden ratio swirls, and volume from fine cross-hatching.
28 runs
lucataco/featured-vid
Convert videos down to a web friendly size while maintaining video quality
8 runs
oliland/cog-diffusionlight-turbo
cineball from an image
73 runs
richards630620/tanyazzz
23 runs
lucataco/featured-img
Convert images down to a web friendly size while maintaining image quality
36 runs