

prunaai / p-image-edit
A sub 1 second 0.01$ multi-image editing model built for production use cases. For image generation, check out p-image here: https://replicate.com/prunaai/p-image
31.1M runs


851-labs / background-remover
Remove backgrounds from images.
23.6M runs


jaaari / kokoro-82m
Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)
92.5M runs


aisha-ai-official / animagine-xl-v4-opt
12.6M runs
Alibaba's Happy Horse 1.0 generates videos from text prompts or animates a single image into video. Supports 720p and 1080p, 3-15 second durations, and five aspect ratios.
10.6K runs

openai/gpt-image-2OpenAI's state-of-the-art image generation model. Create and edit images from text with strong instruction following, sharp text rendering, and detailed editing.
3.3M runs

Anthropic's most capable model with a step-change improvement in agentic coding, better vision, and stronger multi-step reasoning
45.8K runs

Google's fast, expressive text-to-speech model with 30 voices and 70+ language support
90.6K runs

Generate full-length songs or instrumentals from a text prompt, with optional auto-generated lyrics
6.7K runs

bytedance/seedance-2.0ByteDance's multimodal video generation model with native audio, multimodal reference inputs, and intelligent duration control.
358.1K runs

Google's cost-efficient video generation model with native audio, optimized for high-volume applications
31.9K runs
prunaai/p-video-avatarp-video-avatar is the fastest and cheapest avatar/lipsync video model on the market.
42K runs

bytedance/seedream-5-liteSeedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge
2.1M runs
Generate videos using xAI's Grok Imagine Video model
913.2K runs

The highest fidelity image model from Black Forest Labs
2.5M runs

Google's fast image generation model with conversational editing, multi-image fusion, and character consistency
9.7M runs
Official models are always on, maintained, and have predictable pricing.
Upscale and enhance video up to 4K at 60fps, with scene-aware presets for AI-generated content, short dramas, UGC, and film restoration.

Google's fast multimodal model with frontier reasoning across agents, coding, and long-context tasks

Create realistic talking avatar videos from text with HeyGen's Avatar V engine — the newest, highest-quality avatar engine with cross-reference-driven animation.

Granite Vision 4.1 4B is a vision-language model (VLM) that delivers frontier-level performance on structured document extraction tasks — chart extraction, table extraction, and semantic key-value pair extraction — in a compact 4B parameter footprint

A faster, lighter Recraft image generation model at ~2048px resolution, optimized for high-volume production. Design taste and prompt accuracy at high resolution with better throughput.

A faster, lighter Recraft image generation model optimized for high-volume and production pipelines. Same design taste as V4.1, built for speed and throughput.
Generate detailed SVG vector graphics from text prompts. Recraft V4.1 Pro's design taste with more geometric detail and finer paths — clean layers, editable output, and scalable to any size.
Generate production-ready SVG vector images from text prompts. Recraft V4.1's design taste applied to vector output — clean geometry, structured layers, and editable paths.

Recraft's latest image generation model at ~2048px resolution. Same design taste and prompt accuracy as V4.1, with higher resolution for print-ready and large-scale work.

Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, and integrated text rendering. Fast and cost-efficient at standard resolution.

xAI's higher-quality image model with sharper details, better text rendering, and 2k output

Transcribe speech with ElevenLabs Scribe v2. 90+ languages, word-level timestamps, speaker diarization for up to 32 speakers, audio event tagging, and keyterm biasing. Files up to 3 GB and 10 hours.

Most expressive text-to-speech model from Inworld, with natural-language steering, real-time latency, and multilingual support across 100+ languages.

The first creative upscaler which keeps identity. Stunning photorealistic results, realistic skin, and full creative control.

Convert text to natural-sounding speech with xAI's Grok TTS. 5 voices, 20 languages, expressive speech tags, and high-fidelity MP3 / WAV / telephony audio output.

Transcribe audio to text with xAI's Grok. Handles 25 languages, word-level timestamps, speaker diarization, multichannel audio, and files up to 500 MB.

Granite Speech 4.1 2B is a compact and efficient speech-language model, specifically designed for multilingual automatic speech recognition (ASR) and bidirectional automatic speech translation (AST) for English, French, German, Spanish, Portuguese and Jap
Alibaba's Happy Horse 1.0 generates videos from text prompts or animates a single image into video. Supports 720p and 1080p, 3-15 second durations, and five aspect ratios.

Granite-embedding-small-english-r2 is a 47M parameter dense biencoder embedding model from the Granite Embeddings collection that can be used to generate high quality text embeddings.

Granite-4.1-8B is a 8B parameter long-context instruct model finetuned from Granite-4.1-8B-Base using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets.
Use AI to generate images & photos with an API
Use AI to understand, describe, and caption videos with an API
Use AI for text-to-speech or to clone your voice via API
Use AI to generate images from a face with an API
Use AI to generate videos with an API
Use AI to upscale and enhance images with an API
Use AI to generate music with an API
Use AI to edit any image via API
Use AI to transcribe speech to text with an API
Use AI For Optical Character Recognition (OCR) to extract text from images via API
Use AI to remove backgrounds from images and videos with an API
FLUX AI models by Black Forest Labs: image generation & editing via API
Use AI to restore images via API
Use AI to upscale, restore, extend, and enhance videos with an API
Detect NSFW content in images and text
Classify text by sentiment, topic, intent, or safety
Identify speakers from audio and video inputs
Replace faces across images with natural-looking results.
Transform rough sketches into polished visuals
Generate custom emojis from text or images
Create anime-style characters, scenes, and animations
Use AI to generate videos from images with an API
Chat with images — visual Q&A, analysis, and reasoning via API
Use AI to generate captions and descriptions from images with an API
Use AI to edit, restyle, extend, and remix videos with an API
WAN family of models: open-source video, image, and audio generation
Generate 3D objects, meshes, and textures from text or images with an API
Official models are always on, predictably priced, and have a stable API.
Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API
Try AI Models for free: video generation, image generation, upscaling, and photo restoration
Use AI to generate lipsync videos with an API
Use AI to control image generation with an API
Embedding models for AI search and analysis
Use AI object detection and segmentation models to distinguish objects in images & videos
Flux fine-tunes: build and run custom AI image models via API
Kontext fine-tunes: Build custom AI image models with an API
Create songs with voice cloning models via API
AI media utilities: auto-caption, watermark, frame extraction & more via API
Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.


colinhughes2121 / ai-charcoal-portrait
Transform any photo into a moody charcoal drawing. Bold strokes, dramatic shading, fine art gallery quality. For portraits, memorial pieces, gifts.
1 run


colinhughes2121 / ai-pop-art-stylizer
Turn any photo into Andy Warhol-style pop art. Bold flat colors, halftone dots, 1960s aesthetic. For gifts, prints, fan art, party decor.
1 run


colinhughes2121 / ai-vintage-film-photo
Convert any digital photo into 35mm film photography aesthetic. Kodak Portra, Fuji Superia film stock looks, organic grain. For photographers, content creators.
1 run


colinhughes2121 / ai-anime-villain
Transform any photo into a menacing anime villain character. Dark dramatic lighting, smirking expression, manga antagonist style. For fan art, profile pics, party invites.
1 run


colinhughes2121 / ai-tarot-card-stylizer
Transform any photo into a mystical tarot card illustration. Rider-Waite aesthetic, gold leaf accents, esoteric symbolism. For readers, spiritualists, gifts, fan art.
1 run


colinhughes2121 / ai-fantasy-landscape
Transform any landscape into an epic fantasy art scene. Lord of the Rings / D&D aesthetic, dramatic skies, magical atmosphere. For tabletop gamers, fan art, wallpapers.
1 run


colinhughes2121 / ai-cyberpunk-city
Convert any cityscape into a neon-soaked Cyberpunk 2077 / Blade Runner scene. Rain-slicked streets, glowing signs, futuristic atmosphere. For wallpapers, banners, fan art.
1 run


colinhughes2121 / ai-greek-statue
Turn any photo into a classical Greek marble statue portrait. Sculpted features, ancient aesthetic, museum quality. Trending social aesthetic, gifts, fan art.
1 run


colinhughes2121 / ai-egyptian-pharaoh
Transform any photo into an ancient Egyptian pharaoh or queen portrait. Gold headdresses, hieroglyphics, mystical desert lighting. Halloween, gifts, fan art.
1 run


colinhughes2121 / ai-medieval-portrait
Turn any photo into a medieval European court portrait. Knights, lords, ladies, period-accurate clothing. For RPG players, history buffs, fan art, gifts.
1 run


colinhughes2121 / ai-nft-art-stylizer
Transform any photo into trending NFT collection art style. Punk-style, Apes-style, geometric, layered traits aesthetic. For collectors, traders, profile pics.
2 runs


colinhughes2121 / ai-retro-pixel-art
Transform any photo into 16-bit retro pixel art. NES/SNES/Sega Genesis aesthetic. For gamers, indie devs, profile pics, NFTs.
1 run