Official

anthropic / claude-3.5-sonnet

Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)

  • Public
  • 259.2K runs
  • Priced per token
  • Commercial use
  • License

Input

*string
Shift + Return to add a new line

Input prompt

file

Optional input image. Images are priced as (width px * height px)/750 input tokens

string
Shift + Return to add a new line

System prompt

Default: ""

integer
(minimum: 1, maximum: 8192)

Maximum number of output tokens

Default: 8192

number
(minimum: 0.001, maximum: 2)

Maximum image resolution in megapixels. Scales down image before sending it to Claude, to save time and money.

Default: 0.5

Output

Here's a humorous yet enormous pancake recipe scaled up for California's population (approximately 40 million people): "California-Sized Pancake Recipe" Ingredients: * 20 million cups all-purpose flour (about 2,500 tons) * 40 million eggs (need about 3.3 million chickens working overtime) * 15 million cups milk (about 240,000 cows' daily production) * 10 million tablespoons baking powder * 5 million teaspoons salt * 5 million cups sugar * 5 million cups melted butter You'll need: * The entire Central Valley as a mixing bowl * Several helicopters with giant whisks * A tectonic plate-sized griddle * The Pacific Ocean converted to maple syrup Instructions: 1. Mix dry ingredients in Death Valley 2. Combine wet ingredients in Lake Tahoe 3. Pour batter using the California Aqueduct 4. Cook on the San Andreas Fault line for even heat distribution 5. Flip using coordinated effort from entire US Air Force Warning: May cause minor earthquakes when flipping. Not responsible for syrup-related flooding. Serves: 40 million Californians, plus leftovers for Nevada Note: This is obviously a silly recipe - it would be logistically impossible to make and cook a pancake this large! 😄
Generated in
Input tokens
21
Output tokens
320
Tokens per second
54.01 tokens / second
Time to first token

Pricing

Official model
Pricing for official models works differently from other models. Instead of being billed by time, you’re billed by input and output, making pricing more predictable.

This language model is priced by how many input tokens are sent as inputs and how many output tokens are generated.

TypePer unitPer $1
Input
$3.75 / 1M tokens
or
260K tokens / $1
Output
$18.75 / 1M tokens
or
53K tokens / $1

For example, for $10 you can run around 762 predictions where the input is a sentence or two (15 tokens) and the output is a few paragraphs (700 tokens).

Check out our docs for more information about how per-token pricing works on Replicate.

Readme

Claude 3.5 Sonnet

Claude 3.5 Sonnet is Anthropic’s latest frontier large language model, offering state-of-the-art performance with significantly improved speed and cost-effectiveness compared to previous models.

This Replicate model is built on claude-3-5-sonnet-20241022, hosted on Anthropic and Vertex APIs.

Key features

  • Best-in-class performance on graduate-level reasoning (GPQA), undergraduate knowledge (MMLU), and coding (HumanEval)
  • 2x faster inference speed compared to Claude 3 Opus
  • 200K token context window
  • Advanced vision capabilities for interpreting images, charts, and graphics
  • Exceptional natural language understanding and generation
  • Industry-leading safety and privacy protections

Use cases

  • Complex customer support
  • Multi-step workflow orchestration
  • Code generation and debugging
  • Legacy code migration
  • Visual analysis and interpretation
  • Content generation with natural tone
  • Academic and research assistance

Limitations

  • Model does not learn from user interactions
  • No persistent memory between sessions
  • Subject to hallucinations and potential biases
  • Cannot access real-time information
  • Limited to text and image inputs

Safety

  • Rigorous safety testing and red teaming conducted
  • Evaluated by UK and US AI Safety Institutes
  • Enhanced safety mechanisms based on expert feedback
  • No training on user data without explicit consent
  • Built-in safeguards against misuse

Getting started

To use Claude 3.5 Sonnet via Replicate:

import replicate

output = replicate.run(
    "anthropic/claude-3.5-sonnet",
    input={
        "prompt": "Your prompt here"
    }
)

Privacy policy & license

Data from this model is sent to Anthropic and Google Cloud Vertex AI.

Usage of this model is subject to Anthropic’s terms of service. Please refer to their website for full terms and conditions.