0xsmw/workgpt:60050bf6 | Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
prompt	string		Prompt to send to LLaMA.
n	integer	1 Min: 1 Max: 5	Number of output sequences to generate
total_tokens	integer	2000 Min: 1	Maximum number of tokens for input + generation. A word is generally 2-3 tokens
temperature	number	0.75 Min: 0.01 Max: 5	Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.
top_p	number	1 Min: 0.01 Max: 1	When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens
repetition_penalty	number	1 Min: 0.01 Max: 5	Penalty for repeated words in generated text; 1 is no penalty, values greater than 1 discourage repetition, less than 1 encourage it.

The shape of the response you’ll get when you run this model with an API.

Schema

{'items': {'type': 'string'}, 'title': 'Output', 'type': 'array'}