Collections

Use official models

Official models are always on, maintained, and have predictable pricing.

Recommended models

anthropic / claude-3.5-haiku

Anthropic's fastest, most cost-effective model, with a 200K token context window (claude-3-5-haiku-20241022)

1.7K runs

anthropic / claude-3.5-sonnet

Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)

1.5K runs

minimax / video-01-director

Generate videos with specific camera movements

729 runs

google / imagen-3-fast

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

12.6K runs

google / imagen-3

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

38.3K runs

black-forest-labs / flux-depth-pro

Professional depth-aware image generation. Edit images while preserving spatial relationships.

52.5K runs

black-forest-labs / flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

83.2K runs

black-forest-labs / flux-fill-pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

200.9K runs

black-forest-labs / flux-1.1-pro

Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.

15M runs

black-forest-labs / flux-1.1-pro-ultra

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.

5.7M runs

black-forest-labs / flux-pro

State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.

9.2M runs

black-forest-labs / flux-schnell

The fastest image generation model tailored for local development and personal use

219.8M runs

black-forest-labs / flux-fill-dev

Open-weight inpainting model for editing and extending images. Guidance-distilled from FLUX.1 Fill [pro].

103.3K runs

deepseek-ai / deepseek-r1

A reasoning model trained with reinforcement learning, on par with OpenAI o1

250.1K runs

luma / ray-2-720p

Generate 5s and 9s 720p videos

1.4K runs

luma / ray-2-540p

Generate 5s and 9s 540p videos

512 runs

minimax / video-01

Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.

205.7K runs

recraft-ai / recraft-creative-upscale

Creative Upscale focuses on enhancing details and refining complex elements in the image. It doesn’t just increase resolution but adds depth by improving textures, fine details, and facial features.

940 runs

recraft-ai / recraft-crisp-upscale

Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.

5.4K runs

playht / play-dialog

End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware prosody, intonation, and emotional expression.

3.6K runs

kwaivgi / kling-v1.6-standard

Generate 5s and 10s videos in 720p resolution

41.7K runs

kwaivgi / kling-v1.6-pro

Generate 5s and 10s videos in 1080p resolution

32.1K runs

black-forest-labs / flux-dev

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

11.2M runs

ibm-granite / granite-3.1-8b-instruct

Granite-3.1-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

431K runs

ibm-granite / granite-3.1-2b-instruct

Granite-3.1-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

8.2K runs

minimax / music-01

Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track

45.7K runs

minimax / video-01-live

An image-to-video (I2V) model specifically trained for Live2D and general animation use cases

56.4K runs

luma / ray

Fast, high quality text-to-video and image-to-video (Also known as Dream Machine)

12.2K runs

recraft-ai / recraft-20b

Affordable and fast images

57.6K runs

recraft-ai / recraft-20b-svg

Affordable and fast vector images

7.8K runs

luma / photon-flash

Accelerated variant of Photon prioritizing speed while maintaining quality

38.1K runs

luma / photon

High-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs

179.1K runs

haiper-ai / haiper-video-2

Generate 4s and 6s videos from a prompt or image

6K runs

black-forest-labs / flux-dev-lora

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference

246.9K runs

black-forest-labs / flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

399.9K runs

black-forest-labs / flux-depth-dev

Open-weight depth-aware image generation. Edit images while preserving spatial relationships.

57.9K runs

black-forest-labs / flux-canny-dev

Open-weight edge-guided image generation. Control structure and composition using Canny edge detection.

38.1K runs

black-forest-labs / flux-redux-schnell

Fast, efficient image variation model for rapid iteration and experimentation.

18.4K runs

black-forest-labs / flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

41.4K runs

recraft-ai / recraft-v3-svg

Recraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.

59.6K runs

recraft-ai / recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

1.1M runs

stability-ai / stable-diffusion-3.5-medium

2.5 billion parameter image model with improved MMDiT-X architecture

17K runs

stability-ai / stable-diffusion-3.5-large-turbo

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps

70.1K runs

stability-ai / stable-diffusion-3.5-large

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.

472.4K runs

ideogram-ai / ideogram-v2-turbo

A fast image model with state of the art inpainting, prompt comprehension and text rendering.

738K runs

ideogram-ai / ideogram-v2

An excellent image model with state of the art inpainting, prompt comprehension and text rendering

466.1K runs

ibm-granite / granite-3.0-8b-instruct

Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

181.4K runs

ibm-granite / granite-3.0-2b-instruct

Granite-3.0-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

420.3K runs

ibm-granite / granite-8b-code-instruct-128k

Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community

533.7K runs

ibm-granite / granite-20b-code-instruct-8k

Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community

110K runs

meta / meta-llama-3.1-405b-instruct

Meta's flagship 405 billion parameter language model, fine-tuned for chat completions

4.8M runs

stability-ai / stable-diffusion-3

A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency

1.6M runs

meta / meta-llama-3-70b

Base version of Llama 3, a 70 billion parameter language model from Meta.

817.9K runs

meta / meta-llama-3-70b-instruct

A 70 billion parameter language model from Meta, fine tuned for chat completions

142.9M runs

meta / meta-llama-3-8b-instruct

An 8 billion parameter language model from Meta, fine tuned for chat completions

310.5M runs

meta / meta-llama-3-8b

Base version of Llama 3, an 8 billion parameter language model from Meta.

50.8M runs

meta / llama-2-7b-chat

A 7 billion parameter language model from Meta, fine tuned for chat completions

17.1M runs

mistralai / mistral-7b-v0.1

A 7 billion parameter language model from Mistral.

1.9M runs

meta / llama-2-70b-chat

A 70 billion parameter language model from Meta, fine tuned for chat completions

9.1M runs

meta / llama-2-70b

Base version of Llama 2, a 70 billion parameter language model from Meta.

351.8K runs

meta / llama-2-13b-chat

A 13 billion parameter language model from Meta, fine tuned for chat completions

4.8M runs

meta / llama-2-13b

Base version of Llama 2 13B, a 13 billion parameter language model

201.6K runs

meta / llama-2-7b

Base version of Llama 2 7B, a 7 billion parameter language model

650.9K runs