Run johnnyoshika/llama2-combine-numbers using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run(
"johnnyoshika/llama2-combine-numbers:3d318c904899fa396a3255078da6a56c0d4f0b7837550159f196eb05932aae0a",
input={
"debug": False,
"top_p": 0.95,
"prompt": "What is 10+4?",
"temperature": 0.7,
"return_logits": False,
"max_new_tokens": 128,
"min_new_tokens": -1,
"repetition_penalty": 1.15
}
)
# The johnnyoshika/llama2-combine-numbers model can stream output as it's running.# The predict method returns an iterator, and you can iterate over that output.for item in output:
# https://replicate.com/johnnyoshika/llama2-combine-numbers/api#output-schemaprint(item, end="")