You're looking at a specific version of this model. Jump to the model overview.

moinnadeem /mlc_llama_70b:cb441f71

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
prompt
string
Can you write a poem about open source machine learning? Let's make it in the style of E. E. Cummings.
Prompt to send to Llama v2.
system_prompt
string
"[INST] <<SYS>> You are a helpful, respectful and honest assistant. " "Always answer as helpfully as possible, while being safe. " "Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, " "or illegal content. " "Please ensure that your responses are socially unbiased and positive in nature. " "If a question does not make any sense, or is not factually coherent, explain why instead " "of answering something not correct. " "If you don't know the answer to a question, please don't share false " "information. <</SYS>> "
System prompt to send to Llama v2. This is prepended to the prompt and helps guide system behavior.
max_new_tokens
integer
500

Min: 1

Maximum number of tokens to generate. A word is generally 2-3 tokens
temperature
number
0.95

Min: 0.01

Max: 5

Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.
top_p
number
0.95

Max: 1

When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens
repetition_penalty
number
1.15

Min: 0.01

Max: 5

Penalty for repeated words in generated text; 1 is no penalty, values greater than 1 discourage repetition, less than 1 encourage it.
stop_str
string
A sequence to stop generation at. For example, '<end>' will stop generation at the first instance of '<end>'.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'items': {'type': 'string'},
 'title': 'Output',
 'type': 'array',
 'x-cog-array-display': 'concatenate',
 'x-cog-array-type': 'iterator'}