hamelsmu
/
test-mistral-7b-instruct-v0.2
- Public
- 5 runs
Run hamelsmu/test-mistral-7b-instruct-v0.2 with an API
Use one of our client libraries to get started quickly. Clicking on a library will take you to the Playground tab where you can tweak different inputs, see the results, and copy the corresponding code to use in your own project.
Input schema
The fields you can use to run this model with an API. If you don't give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
prompt |
string
|
None
|
|
max_new_tokens |
integer
|
512
|
The maximum number of tokens the model should generate as output.
|
temperature |
number
|
0.6
|
The value used to modulate the next token probabilities.
|
top_p |
number
|
0.9
|
A probability threshold for generating the output. If < 1.0, only keep the top tokens with cumulative probability >= top_p (nucleus filtering). Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751).
|
top_k |
integer
|
50
|
The number of highest probability tokens to consider for generating the output. If > 0, only keep the top k tokens with highest probability (top-k filtering).
|
presence_penalty |
number
|
0
|
Presence penalty
|
frequency_penalty |
number
|
0
|
Frequency penalty
|
prompt_template |
string
|
<s>[INST] {prompt} [/INST]
|
The template used to format the prompt. The input prompt is inserted into the template using the `{prompt}` placeholder.
|
{
"type": "object",
"title": "Input",
"required": [
"prompt"
],
"properties": {
"top_k": {
"type": "integer",
"title": "Top K",
"default": 50,
"x-order": 4,
"description": "The number of highest probability tokens to consider for generating the output. If > 0, only keep the top k tokens with highest probability (top-k filtering)."
},
"top_p": {
"type": "number",
"title": "Top P",
"default": 0.9,
"x-order": 3,
"description": "A probability threshold for generating the output. If < 1.0, only keep the top tokens with cumulative probability >= top_p (nucleus filtering). Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751)."
},
"prompt": {
"type": "string",
"title": "Prompt",
"x-order": 0
},
"temperature": {
"type": "number",
"title": "Temperature",
"default": 0.6,
"x-order": 2,
"description": "The value used to modulate the next token probabilities."
},
"max_new_tokens": {
"type": "integer",
"title": "Max New Tokens",
"default": 512,
"x-order": 1,
"description": "The maximum number of tokens the model should generate as output."
},
"prompt_template": {
"type": "string",
"title": "Prompt Template",
"default": "<s>[INST] {prompt} [/INST] ",
"x-order": 7,
"description": "The template used to format the prompt. The input prompt is inserted into the template using the `{prompt}` placeholder."
},
"presence_penalty": {
"type": "number",
"title": "Presence Penalty",
"default": 0,
"x-order": 5,
"description": "Presence penalty"
},
"frequency_penalty": {
"type": "number",
"title": "Frequency Penalty",
"default": 0,
"x-order": 6,
"description": "Frequency penalty"
}
}
}
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{
"type": "array",
"items": {
"type": "string"
},
"title": "Output",
"x-cog-array-type": "iterator",
"x-cog-array-display": "concatenate"
}