Use official models
Official models are always on, maintained, and have predictable pricing.
Recommended models

openai / gpt-5-structured
GPT-5 with support for structured outputs, web search and custom tools
Updated 23 hours ago

bytedance / seedance-1-pro
A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
Updated 23 hours ago

openai / gpt-5
OpenAI's new model excelling at coding, writing, and reasoning.
Updated 23 hours ago

bytedance / seedance-1-lite
A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
Updated 23 hours ago

pixverse / pixverse-v4.5
Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.
Updated 1 day, 1 hour ago

luma / ray
Fast, high quality text-to-video and image-to-video (Also known as Dream Machine)
Updated 1 day, 1 hour ago
bytedance / omni-human
Turns your audio/video/images into professional-quality animated videos
Updated 1 day, 3 hours ago
runwayml / gen4-aleph
A new way to edit, transform and generate video
Updated 1 day, 3 hours ago

recraft-ai / recraft-remove-background
Automated background removal for images. Tuned for AI-generated content, product photos, portraits, and design workflows
Updated 1 day, 3 hours ago

ibm-granite / granite-3.3-8b-instruct
Granite-3.3-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for improved reasoning and instruction-following capabilities.
Updated 1 day, 3 hours ago

google / gemini-2.5-flash-image
Google's latest image generation model in Gemini 2.5
Updated 1 day, 3 hours ago

google / nano-banana
Google's latest image editing model in Gemini 2.5
Updated 1 day, 4 hours ago

luma / modify-video
Modify a video with style transfer and prompt-based editing
Updated 2 days, 1 hour ago
kwaivgi / kling-lip-sync
Add lip-sync to any video with an audio file or text
Updated 2 days, 6 hours ago

pixverse / pixverse-v5
Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
Updated 2 days, 7 hours ago
mirelo / video-to-sfx
Generate synced sounds for any video, and return it with its new sound track
Updated 2 days, 19 hours ago

google / imagen-4-fast
Use this fast version of Imagen 4 when speed and cost are more important than quality
Updated 2 days, 22 hours ago

openai / gpt-5-nano
Fastest, most cost-effective GPT-5 model from OpenAI
Updated 3 days, 3 hours ago

openai / gpt-5-mini
Faster version of OpenAI's flagship GPT-5 model
Updated 3 days, 4 hours ago

luma / photon-flash
Accelerated variant of Photon prioritizing speed while maintaining quality
Updated 3 days, 5 hours ago

intelligent-utilities / html-to-image
Updated 3 days, 5 hours ago

flux-kontext-apps / portrait-series
Create a series of portrait photos from a single image
Updated 3 days, 5 hours ago

openai / gpt-image-1
A multimodal image generation model that creates high-quality images. You need to bring your own verified OpenAI key to use this model. Your OpenAI account will be charged for usage.
Updated 3 days, 5 hours ago
heygen / video-translate
Translate videos into over 150 languages
Updated 3 days, 23 hours ago
runwayml / gen4-turbo
Generate 5s and 10s 720p videos fast
Updated 4 days, 2 hours ago

openai / gpt-4.1
OpenAI's Flagship GPT model for complex tasks.
Updated 4 days, 3 hours ago

openai / gpt-4.1-nano
Fastest, most cost-effective GPT-4.1 model from OpenAI
Updated 4 days, 3 hours ago

openai / gpt-4.1-mini
Fast, affordable version of GPT-4.1
Updated 4 days, 3 hours ago

black-forest-labs / flux-depth-pro
Professional depth-aware image generation. Edit images while preserving spatial relationships.
Updated 4 days, 3 hours ago

minimax / image-01
Minimax's first image model, with character reference support
Updated 4 days, 5 hours ago

bria / expand-image
Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand to the desired aspect ratio. Trained exclusively on licensed data for safe and risk-free commercial use
Updated 4 days, 22 hours ago

flux-kontext-apps / multi-image-list
FLUX Kontext max with list input for multiple images
Updated 4 days, 23 hours ago

flux-kontext-apps / face-to-many-kontext
Become a character, in style
Updated 4 days, 23 hours ago
sync / lipsync-2-pro
Studio-grade lipsync in minutes, not weeks
Updated 4 days, 23 hours ago
sync / lipsync-2
Generate realistic lipsyncs with Sync Labs' 2.0 model
Updated 4 days, 23 hours ago

pixverse / pixverse-v4
Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p
Updated 5 days, 1 hour ago

leonardoai / lucid-origin
Artistic and high-quality visuals with improved prompt adherence, diversity, and definition
Updated 5 days, 1 hour ago

google / veo-3
Sound on: Google’s flagship Veo 3 text to video model, with audio
Updated 5 days, 3 hours ago

luma / reframe-image
Change the aspect ratio of any photo using AI (not cropping)
Updated 5 days, 4 hours ago

tencent / hunyuan-image-2.1
Generate high-quality 2K resolution images from text prompts
Updated 1 week ago

black-forest-labs / flux-canny-pro
Professional edge-guided image generation. Control structure and composition using Canny edge detection
Updated 1 week, 1 day ago
wan-video / wan-2.2-s2v
Generate a video from an audio clip and a reference image
Updated 1 week, 1 day ago

luma / ray-flash-2-720p
Generate 5s and 9s 720p videos, faster and cheaper than Ray 2
Updated 1 week, 1 day ago

minimax / music-1.5
Music-1.5: Full-length songs (up to 4 mins) with natural vocals & rich instrumentation
Updated 1 week, 1 day ago

minimax / video-01-director
Generate videos with specific camera movements
Updated 1 week, 1 day ago

kwaivgi / kling-v1.6-pro
Generate 5s and 10s videos in 1080p resolution
Updated 1 week, 1 day ago
runwayml / upscale-v1
Upscale videos by 4x, up to a maximum of 4k
Updated 1 week, 1 day ago

bytedance / seedream-4
Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
Updated 1 week, 1 day ago

bria / remove-background
Bria AI's remove background model
Updated 1 week, 1 day ago

openai / dall-e-3
An AI system that can create realistic images and art from a description in natural language.
Updated 1 week, 2 days ago

google / imagen-4-ultra
Use this ultra version of Imagen 4 when quality matters more than speed and cost
Updated 1 week, 2 days ago

google / veo-3-fast
A faster and cheaper version of Google’s Veo 3 video model, with audio
Updated 1 week, 2 days ago

stability-ai / stable-audio-2.5
Generate high-quality music and sound from text prompts
Updated 1 week, 2 days ago

bria / product-shadow
Add consistent, customizable shadows to product cutouts for enhanced visual appeal
Updated 1 week, 2 days ago

bria / product-packshot
Transform any product photo into professional 2000x2000px packshots with optimal positioning
Updated 1 week, 2 days ago

bria / product-cutout
Precise AI-powered product cutout with 256-level transparency for eCommerce
Updated 1 week, 2 days ago

openai / gpt-4o
OpenAI's high-intelligence chat model
Updated 1 week, 2 days ago

minimax / music-01
Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track
Updated 1 week, 2 days ago

ideogram-ai / ideogram-v3-quality
The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 1 week, 3 days ago

ideogram-ai / ideogram-v3-turbo
Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 1 week, 3 days ago

ideogram-ai / ideogram-v3-balanced
Balance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles
Updated 1 week, 3 days ago

bytedance / seededit-3.0
Text-guided image editing model that preserves original details while making targeted modifications like lighting changes, object removal, and style conversion
Updated 1 week, 3 days ago

bytedance / seedream-3
A text-to-image model with support for native high-resolution (2K) image generation
Updated 1 week, 3 days ago

google / lyria-2
Lyria 2 is a music generation model that produces 48kHz stereo audio through text-based prompts
Updated 1 week, 3 days ago

black-forest-labs / flux-kontext-pro
A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language
Updated 1 week, 5 days ago

qwen / qwen-image
An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.
Updated 2 weeks, 1 day ago

kwaivgi / kling-v2.1-master
A premium version of Kling v2.1 with superb dynamics and prompt adherence. Generate 1080p 5s and 10s videos from text or an image
Updated 2 weeks, 3 days ago

kwaivgi / kling-v2.1
Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)
Updated 2 weeks, 3 days ago
kwaivgi / kling-v1.5-pro
Generate 5s and 10s videos in 1080p resolution at 30fps
Updated 2 weeks, 3 days ago
kwaivgi / kling-v1.5-standard
Generate 5s and 10s videos in 720p resolution at 30fps
Updated 2 weeks, 3 days ago

kwaivgi / kling-v1.6-standard
Generate 5s and 10s videos in 720p resolution at 30fps
Updated 2 weeks, 3 days ago

kwaivgi / kling-v2.0
Generate 5s and 10s videos in 720p resolution
Updated 2 weeks, 3 days ago
minimax / hailuo-02
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.
Updated 2 weeks, 6 days ago

deepseek-ai / deepseek-v3.1
Latest hybrid thinking model from Deepseek
Updated 3 weeks, 2 days ago

recraft-ai / recraft-vectorize
Convert raster images to high-quality SVG format with precision and clean vector paths, perfect for logos, icons, and scalable graphics.
Updated 3 weeks, 3 days ago

bytedance / dreamina-3.1
4MP text-to-image generation with enhanced cinematic-quality image generation with precise style control, improved text rendering, and commercial design optimization.
Updated 1 month ago

qwen / qwen-image-lora-trainer
Fine-tunable Qwen Image model with exceptional composition abilities - train custom LoRAs for any style or subject
Updated 1 month ago

qwen / qwen-image-edit
Edit images using a prompt. This model extends Qwen-Image’s unique text rendering capabilities to image editing tasks, enabling precise text editing
Updated 1 month ago

fofr / color-matcher
Color match and white balance fixes for images
Updated 1 month, 1 week ago

openai / o4-mini
OpenAI's fast, lightweight reasoning model
Updated 1 month, 1 week ago

openai / o1-mini
A small model alternative to o1
Updated 1 month, 1 week ago

openai / o1
OpenAI's first o-series reasoning model
Updated 1 month, 1 week ago

openai / gpt-4o-mini
Low latency, low cost version of OpenAI's GPT-4o model
Updated 1 month, 1 week ago

runwayml / gen4-image-turbo
Gen-4 Image Turbo is cheaper and 2.5x faster than Gen-4 Image. An image model with references, use up to 3 reference images to create the exact image you need. Capture every angle.
Updated 1 month, 1 week ago

wan-video / wan-2.2-i2v-a14b
Image-to-video at 720p and 480p with Wan 2.2 A14B
Updated 1 month, 1 week ago
wan-video / wan-2.2-i2v-fast
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B image-to-video
Updated 1 month, 1 week ago
wan-video / wan-2.2-t2v-fast
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video
Updated 1 month, 1 week ago
wan-video / wan-2.2-5b-fast
The fastest Wan 2.2 text-to-image and image-to-video model
Updated 1 month, 1 week ago

ideogram-ai / ideogram-character
Generate consistent characters from a single reference image. Outputs can be in many styles. You can also use inpainting to add your character to an existing image.
Updated 1 month, 1 week ago

qwen / qwen3-235b-a22b-instruct-2507
Updated Qwen3 model for instruction following
Updated 1 month, 2 weeks ago

ideogram-ai / ideogram-v2a-turbo
Like Ideogram v2 turbo, but now faster and cheaper
Updated 1 month, 2 weeks ago

ideogram-ai / ideogram-v2a
Like Ideogram v2, but faster and cheaper
Updated 1 month, 2 weeks ago

ideogram-ai / ideogram-v2-turbo
A fast image model with state of the art inpainting, prompt comprehension and text rendering.
Updated 1 month, 2 weeks ago

ideogram-ai / ideogram-v2
An excellent image model with state of the art inpainting, prompt comprehension and text rendering
Updated 1 month, 2 weeks ago

moonshotai / kimi-k2-instruct
Kimi K2 achieves exceptional performance across frontier knowledge, reasoning, and coding tasks while being meticulously optimized for agentic capabilities
Updated 1 month, 2 weeks ago

openai / gpt-oss-20b
20b open-weight language model from OpenAI
Updated 1 month, 2 weeks ago

openai / gpt-oss-120b
120b open-weight language model from OpenAI
Updated 1 month, 2 weeks ago

wavespeedai / qwen-image
A 20B MMDiT model for next-gen text-to-image generation
Updated 1 month, 2 weeks ago

wavespeedai / wan-2.1-i2v-720p
Accelerated inference for Wan 2.1 14B image to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 2 weeks ago

wavespeedai / wan-2.1-i2v-480p
Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 2 weeks ago

wavespeedai / wan-2.1-t2v-720p
Accelerated inference for Wan 2.1 14B text to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 2 weeks ago

wavespeedai / wan-2.1-t2v-480p
Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 2 weeks ago
minimax / hailuo-02-fast
A low cost and fast version of Hailuo 02. Generate 6s and 10s videos in 512p
Updated 1 month, 2 weeks ago

openai / clip
Official CLIP models, generate CLIP (clip-vit-large-patch14) text & image embeddings
Updated 1 month, 2 weeks ago

ibm-granite / granite-speech-3.3-8b
Granite-speech-3.3-8b is a compact and efficient speech-language model, specifically designed for automatic speech recognition (ASR) and automatic speech translation (AST).
Updated 1 month, 2 weeks ago

minimax / video-01
Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.
Updated 1 month, 3 weeks ago

ibm-granite / granite-vision-3.3-2b
Granite-vision-3.3-2b is a compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
Updated 1 month, 3 weeks ago

luma / ray-2-540p
Generate 5s and 9s 540p videos
Updated 1 month, 3 weeks ago

luma / ray-2-720p
Generate 5s and 9s 720p videos
Updated 1 month, 3 weeks ago

luma / reframe-video
Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p
Updated 1 month, 3 weeks ago
luma / ray-flash-2-540p
Generate 5s and 9s 540p videos, faster and cheaper than Ray 2
Updated 1 month, 3 weeks ago

flux-kontext-apps / cartoonify
Turn your image into a cartoon with FLUX.1 Kontext [pro]
Updated 1 month, 3 weeks ago

flux-kontext-apps / multi-image-kontext-pro
An experimental model with FLUX Kontext Pro that can combine two input images
Updated 1 month, 3 weeks ago

flux-kontext-apps / restyle-video-frame
Use flux-kontext-pro to change the first or last frame of a video. Useful to use as inputs for restyling an entire video in a certain way
Updated 1 month, 3 weeks ago

flux-kontext-apps / impossible-scenarios
Experience impossible adventures and extreme scenarios from a single image
Updated 1 month, 3 weeks ago

flux-kontext-apps / text-removal
Remove all text from an image with FLUX.1 Kontext
Updated 1 month, 3 weeks ago

flux-kontext-apps / multi-image-kontext-max
An experimental FLUX Kontext model that can combine two input images
Updated 1 month, 3 weeks ago

flux-kontext-apps / iconic-locations
Put yourself in an iconic location around the world from a single image
Updated 1 month, 3 weeks ago

flux-kontext-apps / restore-image
Use FLUX Kontext to restore, fix scratches and damage, and colorize old photos
Updated 1 month, 3 weeks ago

flux-kontext-apps / professional-headshot
Create a professional headshot photo from any single image
Updated 1 month, 3 weeks ago

black-forest-labs / flux-kontext-max
A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts
Updated 1 month, 3 weeks ago

flux-kontext-apps / change-haircut
Quickly change someone's hair style and hair color, powered by FLUX.1 Kontext [pro]
Updated 1 month, 3 weeks ago

black-forest-labs / flux-kontext-dev-lora
FLUX.1 Kontext[dev] image editing model for running lora finetunes
Updated 1 month, 4 weeks ago

runwayml / gen4-image
Runway's Gen-4 Image model with references. Use up to 3 reference images to create the exact image you need. Capture every angle.
Updated 1 month, 4 weeks ago

google / imagen-4
Google's Imagen 4 flagship model
Updated 2 months ago

google / imagen-3-fast
A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality
Updated 2 months ago

google / imagen-3
Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty
Updated 2 months ago

prunaai / wan-2.2-image
This model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package
Updated 2 months ago

bria / image-3.2
Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market
Updated 2 months ago

bria / generate-background
Bria Background Generation allows for efficient swapping of backgrounds in images via text prompts or reference image, delivering realistic and polished results. Trained exclusively on licensed data for safe and risk-free commercial use
Updated 2 months, 1 week ago

bria / genfill
Bria GenFill enables high-quality object addition or visual transformation. Trained exclusively on licensed data for safe and risk-free commercial use.
Updated 2 months, 1 week ago

pixverse / pixverse-v3.5
Create videos in as little as 10 seconds. 5s or 8s videos at 360p, 540p, 720p or 1080p.
Updated 2 months, 1 week ago

bria / eraser
SOTA Object removal, enables precise removal of unwanted objects from images while maintaining high-quality outputs. Trained exclusively on licensed data for safe and risk-free commercial use
Updated 2 months, 2 weeks ago

bria / increase-resolution
Bria Increase resolution upscales the resolution of any image. It increases resolution using a dedicated upscaling method that preserves the original image content without regeneration.
Updated 2 months, 2 weeks ago

black-forest-labs / flux-kontext-dev
Open-weight version of FLUX.1 Kontext
Updated 2 months, 3 weeks ago

black-forest-labs / flux-dev-lora
A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference
Updated 2 months, 3 weeks ago

black-forest-labs / flux-dev
A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
Updated 2 months, 3 weeks ago

black-forest-labs / flux-schnell
The fastest image generation model tailored for local development and personal use
Updated 2 months, 3 weeks ago

black-forest-labs / flux-schnell-lora
The fastest image generation model tailored for fine-tuned use
Updated 2 months, 3 weeks ago

meta / llama-guard-4-12b
Updated 2 months, 3 weeks ago

resemble-ai / chatterbox
Generate expressive, natural speech. Features unique emotion control, instant voice cloning from short audio, and built-in watermarking.
Updated 3 months ago

minimax / video-01-live
An image-to-video (I2V) model specifically trained for Live2D and general animation use cases
Updated 3 months ago

resemble-ai / chatterbox-pro
Generate expressive, natural speech with Resemble AI's Chatterbox.
Updated 3 months ago

anthropic / claude-4-sonnet
Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions
Updated 3 months ago

flux-kontext-apps / renaissance
Turn yourself into a renaissance-era painting for those renaissance moments
Updated 3 months, 2 weeks ago

black-forest-labs / flux-1.1-pro-ultra-finetuned
Inference model for FLUX 1.1 [pro] Ultra using custom `finetune_id`. Supports 4MP images and raw mode for realism
Updated 3 months, 2 weeks ago

black-forest-labs / flux-pro-finetuned
Inference model for FLUX.1 [pro] using custom `finetune_id`
Updated 3 months, 2 weeks ago

flux-kontext-apps / filters
Add simple filters to your images
Updated 3 months, 2 weeks ago

flux-kontext-apps / depth-of-field
Bring your subjects into focus with FLUX.1 Kontext [pro]
Updated 3 months, 3 weeks ago

leonardoai / phoenix-1.0
Leonardo AI’s first foundational model produces images up to 5 megapixels (fast, quality and ultra modes)
Updated 3 months, 3 weeks ago

leonardoai / motion-2.0
Create 5s 480p videos from a text prompt
Updated 3 months, 3 weeks ago

google / veo-2
State of the art video generation model. Veo 2 can faithfully follow simple and complex instructions, and convincingly simulates real-world physics as well as a wide range of visual styles.
Updated 3 months, 4 weeks ago

openai / gpt-4o-transcribe
A speech-to-text model that uses GPT-4o to transcribe audio
Updated 4 months ago

openai / gpt-4o-mini-transcribe
A speech-to-text model that uses GPT-4o mini to transcribe audio
Updated 4 months ago

openai / dall-e-2
The original classic DALLᐧE 2
Updated 4 months ago

minimax / speech-02-turbo
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Designed for real-time applications with low latency
Updated 4 months, 2 weeks ago

minimax / speech-02-hd
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.
Updated 4 months, 2 weeks ago

minimax / voice-cloning
Clone voices to use with Minimax's speech-02-hd and speech-02-turbo
Updated 4 months, 2 weeks ago

easel / ai-avatars
Use one or two face images to create AI avatars
Updated 4 months, 3 weeks ago

black-forest-labs / flux-pro-trainer
Train FLUX.1 [pro] and FLUX 1.1 [pro] Ultra. Upload images to create a custom finetune_id to use with the inference model
Updated 4 months, 3 weeks ago

topazlabs / image-upscale
Professional-grade image upscaling, from Topaz Labs
Updated 4 months, 3 weeks ago

topazlabs / video-upscale
Video Upscaling from Topaz Labs
Updated 4 months, 3 weeks ago

meta / llama-4-maverick-instruct
A 17 billion parameter model with 128 experts
Updated 5 months, 2 weeks ago

meta / llama-4-scout-instruct
A 17 billion parameter model with 16 experts
Updated 5 months, 2 weeks ago

black-forest-labs / flux-fill-dev
Open-weight inpainting model for editing and extending images. Guidance-distilled from FLUX.1 Fill [pro].
Updated 5 months, 2 weeks ago

black-forest-labs / flux-1.1-pro-ultra
FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.
Updated 5 months, 2 weeks ago

black-forest-labs / flux-1.1-pro
Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.
Updated 5 months, 2 weeks ago

black-forest-labs / flux-pro
State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.
Updated 5 months, 2 weeks ago

black-forest-labs / flux-fill-pro
Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.
Updated 5 months, 2 weeks ago

deepseek-ai / deepseek-v3
DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source
Updated 5 months, 3 weeks ago

recraft-ai / recraft-v3
Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis
Updated 5 months, 3 weeks ago

recraft-ai / recraft-v3-svg
Recraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.
Updated 5 months, 3 weeks ago

recraft-ai / recraft-20b-svg
Affordable and fast vector images
Updated 5 months, 3 weeks ago

recraft-ai / recraft-20b
Affordable and fast images
Updated 5 months, 3 weeks ago

black-forest-labs / flux-redux-schnell
Fast, efficient image variation model for rapid iteration and experimentation.
Updated 6 months ago

black-forest-labs / flux-redux-dev
Open-weight image variation model. Create new versions while preserving key elements of your original.
Updated 6 months ago

black-forest-labs / flux-depth-dev
Open-weight depth-aware image generation. Edit images while preserving spatial relationships.
Updated 6 months ago

black-forest-labs / flux-canny-dev
Open-weight edge-guided image generation. Control structure and composition using Canny edge detection.
Updated 6 months ago

easel / advanced-face-swap
Face swap one or two people into a target image
Updated 6 months, 2 weeks ago

ibm-granite / granite-3.2-8b-instruct
Granite-3.2-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for reasoning and instruction-following capabilities.
Updated 6 months, 2 weeks ago

wan-video / wan-2.1-1.3b
Generate 5s 480p videos. Wan is an advanced and powerful visual generation model developed by Tongyi Lab of Alibaba Group
Updated 6 months, 3 weeks ago

anthropic / claude-3.7-sonnet
The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)
Updated 6 months, 3 weeks ago

anthropic / claude-3.5-haiku
Anthropic's fastest, most cost-effective model, with a 200K token context window (claude-3-5-haiku-20241022)
Updated 7 months, 1 week ago

anthropic / claude-3.5-sonnet
Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)
Updated 7 months, 1 week ago

google / upscaler
Upscale images 2x or 4x times
Updated 7 months, 1 week ago

deepseek-ai / deepseek-r1
A reasoning model trained with reinforcement learning, on par with OpenAI o1
Updated 7 months, 3 weeks ago

recraft-ai / recraft-creative-upscale
Creative Upscale focuses on enhancing details and refining complex elements in the image. It doesn’t just increase resolution but adds depth by improving textures, fine details, and facial features.
Updated 8 months ago

recraft-ai / recraft-crisp-upscale
Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.
Updated 8 months ago

playht / play-dialog
End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware prosody, intonation, and emotional expression.
Updated 8 months, 1 week ago

ibm-granite / granite-3.1-8b-instruct
Granite-3.1-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 9 months ago

ibm-granite / granite-3.1-2b-instruct
Granite-3.1-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 9 months ago

luma / photon
High-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs
Updated 9 months, 2 weeks ago

stability-ai / stable-diffusion-3.5-medium
2.5 billion parameter image model with improved MMDiT-X architecture
Updated 10 months, 3 weeks ago

stability-ai / stable-diffusion-3.5-large-turbo
A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps
Updated 10 months, 4 weeks ago

stability-ai / stable-diffusion-3.5-large
A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.
Updated 10 months, 4 weeks ago

ibm-granite / granite-3.0-8b-instruct
Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 11 months ago

ibm-granite / granite-3.0-2b-instruct
Granite-3.0-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 11 months ago

ibm-granite / granite-8b-code-instruct-128k
Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community
Updated 1 year ago

ibm-granite / granite-20b-code-instruct-8k
Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community
Updated 1 year ago

meta / meta-llama-3.1-405b-instruct
Meta's flagship 405 billion parameter language model, fine-tuned for chat completions
Updated 1 year, 1 month ago

stability-ai / stable-diffusion-3
A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency
Updated 1 year, 2 months ago

meta / meta-llama-3-70b
Base version of Llama 3, a 70 billion parameter language model from Meta.
Updated 1 year, 5 months ago

meta / meta-llama-3-70b-instruct
A 70 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 5 months ago

meta / meta-llama-3-8b-instruct
An 8 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 5 months ago

meta / meta-llama-3-8b
Base version of Llama 3, an 8 billion parameter language model from Meta.
Updated 1 year, 5 months ago

falcons-ai / nsfw_image_detection
Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
Updated 1 year, 9 months ago

meta / llama-2-7b-chat
A 7 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 10 months ago

mistralai / mistral-7b-v0.1
A 7 billion parameter language model from Mistral.
Updated 1 year, 11 months ago

meta / llama-2-70b-chat
A 70 billion parameter language model from Meta, fine tuned for chat completions
Updated 2 years ago

meta / llama-2-70b
Base version of Llama 2, a 70 billion parameter language model from Meta.
Updated 2 years ago

meta / llama-2-13b-chat
A 13 billion parameter language model from Meta, fine tuned for chat completions
Updated 2 years ago

meta / llama-2-13b
Base version of Llama 2 13B, a 13 billion parameter language model
Updated 2 years ago

meta / llama-2-7b
Base version of Llama 2 7B, a 7 billion parameter language model
Updated 2 years ago