technillogue/good-staging-llama-2-7b

Public
26 runs

Input

pip install replicate
Set the REPLICATE_API_TOKEN environment variable:
export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:
import replicate

Run technillogue/good-staging-llama-2-7b using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "technillogue/good-staging-llama-2-7b:dd5b07e8f407e7abd2764e1673df98f260aa48ae531b821fb64199ea9c9228fa",
    input={
        "debug": False,
        "top_k": 50,
        "top_p": 0.9,
        "temperature": 0.75,
        "max_new_tokens": 128,
        "min_new_tokens": -1
    }
)

# The technillogue/good-staging-llama-2-7b model can stream output as it's running.
# The predict method returns an iterator, and you can iterate over that output.
for item in output:
    # https://replicate.com/technillogue/good-staging-llama-2-7b/api#output-schema
    print(item, end="")

To learn more, take a look at the guide on getting started with Python.

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

This model doesn't have a readme.