meta/llama-2-7b – Run with an API on Replicate

Official

meta / llama-2-7b

Base version of Llama 2 7B, a 7 billion parameter language model

Cold

Public
651.5K runs
Priced per token

Iterate in playground

Run with an API

Playground API Examples README

Input

prompt

*string

Shift + Return to add a new line

Prompt to send to the model.

max_tokens

integer

(minimum: 1)

Maximum number of tokens to generate. A word is generally 2-3 tokens.

Default: 512

min_tokens

integer

(minimum: -1)

Minimum number of tokens to generate. To disable, set to -1. A word is generally 2-3 tokens.

temperature

number

(minimum: 0, maximum: 5)

Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.

Default: 0.7

top_p

number

(minimum: 0, maximum: 1)

When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens.

Default: 0.95

top_k

integer

(minimum: -1)

When decoding text, samples from the top k most likely tokens; lower to ignore less likely tokens.

Default: 0

stop_sequences

string

Shift + Return to add a new line

A comma-separated list of sequences to stop generation at. For example, '<end>,<stop>' will stop generation at the first instance of 'end' or '<stop>'.

length_penalty

number

(minimum: 0, maximum: 5)

A parameter that controls how long the outputs are. If < 1, the model will tend to generate shorter outputs, and > 1 will tend to generate longer outputs.

Default: 1

presence_penalty

number

A parameter that penalizes repeated tokens regardless of the number of appearances. As the value increases, the model will be less likely to repeat tokens in the output.

Default: 0

seed

integer

Random seed. Leave blank to randomize the seed.

prompt_template

string

Shift + Return to add a new line

Template for formatting the prompt. Can be an arbitrary string, but must contain the substring `{prompt}`.

Default: "{prompt}"

log_performance_metrics

boolean

Default: false

max_new_tokens

integer

(minimum: 1)

This parameter has been renamed to max_tokens. max_new_tokens only exists for backwards compatibility purposes. We recommend you use max_tokens instead. Both may not be specified.

min_new_tokens

integer

(minimum: -1)

This parameter has been renamed to min_tokens. min_new_tokens only exists for backwards compatibility purposes. We recommend you use min_tokens instead. Both may not be specified.

Run this model in Node.js with one line of code:

npx create-replicate --model=meta/llama-2-7b

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run meta/llama-2-7b using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const input = {
  top_k: 250,
  top_p: 0.95,
  prompt: "A llama walks into a bar",
  max_tokens: 512,
  temperature: 0.95,
  length_penalty: 1,
  max_new_tokens: 500,
  min_new_tokens: -1,
  prompt_template: "{prompt}",
  presence_penalty: 0,
  log_performance_metrics: false
};

for await (const event of replicate.stream("meta/llama-2-7b", { input })) {
  process.stdout.write(event.toString());
};

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run meta/llama-2-7b using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

# The meta/llama-2-7b model can stream output as it's running.
for event in replicate.stream(
    "meta/llama-2-7b",
    input={
        "top_k": 250,
        "top_p": 0.95,
        "prompt": "A llama walks into a bar",
        "max_tokens": 512,
        "temperature": 0.95,
        "length_penalty": 1,
        "max_new_tokens": 500,
        "min_new_tokens": -1,
        "prompt_template": "{prompt}",
        "presence_penalty": 0,
        "log_performance_metrics": False
    },
):
    print(str(event), end="")

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run meta/llama-2-7b using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "input": {
      "top_k": 250,
      "top_p": 0.95,
      "prompt": "A llama walks into a bar",
      "max_tokens": 512,
      "temperature": 0.95,
      "length_penalty": 1,
      "max_new_tokens": 500,
      "min_new_tokens": -1,
      "prompt_template": "{prompt}",
      "presence_penalty": 0,
      "log_performance_metrics": false
    }
  }' \
  https://api.replicate.com/v1/models/meta/llama-2-7b/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

Output

, he orders a martini. everyone in the place stops and stares at him," said Jude Lawless: In 1983 he moved to New York City to pursue his performing career; there in November of that year he opened for Paul Simon on tour with Kate Pierson & The B-Sides (formerly of The B-52's). He also appeared with artists such as Miles Davis and Lou Reed who taught him how to write music. In February, 1984 they won an ABC-TV New Day competition contest and released their first single "That's All Right" on Scorpio Records which became so successful that it went Gold in Canada. His latest album "Tales Told By A Dead Man" has received top reviews and is being praised by many critics since its release in September, 2007. Since then, Pansy started singing in local bands including several funk groups where he began learning how to play drums with a backyard bamboo set given to him from his grandmother Mary Beth. As time passed on, Mike developed an interest towards punk rock which sparked him later to form a band called Broken Arrow (Bass/Vocals) originally started in South Los Angeles before playing clubs around Hollywood such as The Roxy Theater or The Whiskey Showroom where Bruce Springsteen was often seen patronizing after his shows according to rumors told by fans themselves! Meanwhile at home while rehearsing one night outside the house listening to The Police and The Who's "Wonka", Mike discovered something about himself...that what felt right seemed wrong because all these great songs sound good no matter who sings them even though sometimes certain voices were better suited than others like Billy Idol when covering David Lee Roth era Van Halen material on TV show or Dweezil Zappa on radio stations too much. This thought process made sense until some day when doing karaoke came up during which his friends said oh yeah dude we need you tonight because we have not heard anyone else sing them yet!. Right? So why ask me if I want another drink huh. He will say thank u, here ya go.. Jake Johnson who is also an actor can be seen recurring roles of Mr. Dexter as the owner for Dog & Scout was chosen specifically

{
  "completed_at": "2023-08-22T16:53:10.779773Z",
  "created_at": "2023-08-22T16:52:05.635899Z",
  "data_removed": false,
  "error": null,
  "id": "zhgtfylb6wygeves7nvf36ar4y",
  "input": {
    "debug": false,
    "top_k": 250,
    "top_p": 0.95,
    "prompt": "A llama walks into a bar",
    "temperature": 0.95,
    "max_new_tokens": 500,
    "min_new_tokens": -1,
    "repetition_penalty": 1.15,
    "repetition_penalty_sustain": 256,
    "token_repetition_penalty_decay": 128
  },
  "logs": "** Speed: 73.21 tokens/second",
  "metrics": {
    "predict_time": 16.854758,
    "total_time": 65.143874
  },
  "output": [
    ",",
    " he",
    " orders",
    " a",
    " mart",
    "ini",
    ".",
    " everyone",
    " in",
    " the",
    " place",
    " stops",
    " and",
    " st",
    "ares",
    " at",
    " him",
    ",\"",
    " said",
    " J",
    "ude",
    " Law",
    "less",
    ":",
    " In",
    " ",
    "1",
    "9",
    "8",
    "3",
    " he",
    " moved",
    " to",
    " New",
    " York",
    " City",
    " to",
    " purs",
    "ue",
    " his",
    " performing",
    " career",
    ";",
    " there",
    " in",
    " November",
    " of",
    " that",
    " year",
    " he",
    " opened",
    " for",
    " Paul",
    " Simon",
    " on",
    " tour",
    " with",
    " Kate",
    " Pi",
    "erson",
    " &",
    " The",
    " B",
    "-",
    "S",
    "ides",
    " (",
    "former",
    "ly",
    " of",
    " The",
    " B",
    "-",
    "5",
    "2",
    "'",
    "s",
    ").",
    " He",
    " also",
    " appeared",
    " with",
    " artists",
    " such",
    " as",
    " Mil",
    "es",
    " Davis",
    " and",
    " Lou",
    " Re",
    "ed",
    " who",
    " taught",
    " him",
    " how",
    " to",
    " write",
    " music",
    ".",
    " In",
    " February",
    ",",
    " ",
    "1",
    "9",
    "8",
    "4",
    " they",
    " won",
    " an",
    " ABC",
    "-",
    "TV",
    " New",
    " Day",
    " competition",
    " contest",
    " and",
    " released",
    " their",
    " first",
    " single",
    " \"",
    "That",
    "'",
    "s",
    " All",
    " Right",
    "\"",
    " on",
    " Sc",
    "or",
    "pio",
    " Records",
    " which",
    " became",
    " so",
    " successful",
    " that",
    " it",
    " went",
    " Gold",
    " in",
    " Canada",
    ".",
    " His",
    " latest",
    " album",
    " \"",
    "T",
    "ales",
    " T",
    "old",
    " By",
    " A",
    " Dead",
    " Man",
    "\"",
    " has",
    " received",
    " top",
    " reviews",
    " and",
    " is",
    " being",
    " pra",
    "ised",
    " by",
    " many",
    " critics",
    " since",
    " its",
    " release",
    " in",
    " September",
    ",",
    " ",
    "2",
    "0",
    "0",
    "7",
    ".",
    "\n",
    "Since",
    " then",
    ",",
    " P",
    "ans",
    "y",
    " started",
    " singing",
    " in",
    " local",
    " bands",
    " including",
    " several",
    " fun",
    "k",
    " groups",
    " where",
    " he",
    " began",
    " learning",
    " how",
    " to",
    " play",
    " drums",
    " with",
    " a",
    " back",
    "yard",
    " b",
    "am",
    "bo",
    "o",
    " set",
    " given",
    " to",
    " him",
    " from",
    " his",
    " grand",
    "m",
    "other",
    " Mary",
    " Beth",
    ".",
    " As",
    " time",
    " passed",
    " on",
    ",",
    " Mike",
    " developed",
    " an",
    " interest",
    " towards",
    " punk",
    " rock",
    " which",
    " spark",
    "ed",
    " him",
    " later",
    " to",
    " form",
    " a",
    " band",
    " called",
    " Bro",
    "ken",
    " Ar",
    "row",
    " (",
    "B",
    "ass",
    "/",
    "V",
    "oc",
    "als",
    ")",
    " originally",
    " started",
    " in",
    " South",
    " Los",
    " Angeles",
    " before",
    " playing",
    " clubs",
    " around",
    " Hollywood",
    " such",
    " as",
    " The",
    " Ro",
    "xy",
    " Theater",
    " or",
    " The",
    " Wh",
    "is",
    "key",
    " Show",
    "room",
    " where",
    " Bruce",
    " Spring",
    "ste",
    "en",
    " was",
    " often",
    " seen",
    " patron",
    "izing",
    " after",
    " his",
    " shows",
    " according",
    " to",
    " rum",
    "ors",
    " told",
    " by",
    " fans",
    " themselves",
    "!",
    " Meanwhile",
    " at",
    " home",
    " while",
    " re",
    "he",
    "ars",
    "ing",
    " one",
    " night",
    " outside",
    " the",
    " house",
    " listening",
    " to",
    " The",
    " Police",
    " and",
    " The",
    " Who",
    "'",
    "s",
    " \"",
    "W",
    "on",
    "ka",
    "\",",
    " Mike",
    " discovered",
    " something",
    " about",
    " himself",
    "...",
    "that",
    " what",
    " felt",
    " right",
    " seemed",
    " wrong",
    " because",
    " all",
    " these",
    " great",
    " songs",
    " sound",
    " good",
    " no",
    " matter",
    " who",
    " s",
    "ings",
    " them",
    " even",
    " though",
    " sometimes",
    " certain",
    " voices",
    " were",
    " better",
    " su",
    "ited",
    " than",
    " others",
    " like",
    " Billy",
    " Id",
    "ol",
    " when",
    " covering",
    " David",
    " Lee",
    " Roth",
    " era",
    " Van",
    " Hal",
    "en",
    " material",
    " on",
    " TV",
    " show",
    " or",
    " D",
    "we",
    "ez",
    "il",
    " Z",
    "appa",
    " on",
    " radio",
    " stations",
    " too",
    " much",
    ".",
    " This",
    " thought",
    " process",
    " made",
    " sense",
    " until",
    " some",
    " day",
    " when",
    " doing",
    " k",
    "ara",
    "oke",
    " came",
    " up",
    " during",
    " which",
    " his",
    " friends",
    " said",
    " oh",
    " yeah",
    " du",
    "de",
    " we",
    " need",
    " you",
    " ton",
    "ight",
    " because",
    " we",
    " have",
    " not",
    " heard",
    " anyone",
    " else",
    " sing",
    " them",
    " yet",
    "!.",
    " Right",
    "?",
    "\n",
    "So",
    " why",
    " ask",
    " me",
    " if",
    " I",
    " want",
    " another",
    " drink",
    " h",
    "uh",
    ".",
    " He",
    " will",
    " say",
    " thank",
    " u",
    ",",
    " here",
    " ya",
    " go",
    "..",
    "\n",
    "J",
    "ake",
    " Johnson",
    " who",
    " is",
    " also",
    " an",
    " actor",
    " can",
    " be",
    " seen",
    " rec",
    "urr",
    "ing",
    " roles",
    " of",
    " Mr",
    ".",
    "\n",
    "D",
    "ex",
    "ter",
    " as",
    " the",
    " owner",
    " for",
    " Dog",
    " &",
    " Sc",
    "out",
    " was",
    " chosen",
    " specifically"
  ],
  "started_at": "2023-08-22T16:52:53.925015Z",
  "status": "succeeded",
  "urls": {
    "get": "https://api.replicate.com/v1/predictions/zhgtfylb6wygeves7nvf36ar4y",
    "cancel": "https://api.replicate.com/v1/predictions/zhgtfylb6wygeves7nvf36ar4y/cancel"
  },
  "version": "acdbe5a4987a29261ba7d7d4195ad4fa6b62ce27b034f989fcb9ab0421408a7c"
}

Generated in

16.9 seconds

Tweak it Report

Pricing

This model is priced by how many input tokens are sent and how many output tokens are generated.

Type	Per unit	Per $1
Input	$0.05 / 1M tokens or 20M tokens / $1
Output	$0.25 / 1M tokens or 4M tokens / $1

For example, for $10 you can run around 57,143 predictions where the input is a sentence or two (15 tokens) and the output is a few paragraphs (700 tokens).

Check out our docs for more information about how per-token pricing works on Replicate.

Readme

Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7 billion parameter base model, which has not been fine-tuned.

Learn more about running Llama 2 with an API and the different models.

Please see ai.meta.com/llama for more information about the model, licensing, and acceptable use.