nateraw / axolotl-trainer-llama-2-7b

  • Public
  • 34 runs
  • L40S

Input

*string
Shift + Return to add a new line
integer

The maximum number of tokens the model should generate as output.

Default: 512

number

The value used to modulate the next token probabilities.

Default: 0.7

boolean

Whether or not to use sampling; otherwise use greedy decoding.

Default: true

number

A probability threshold for generating the output. If < 1.0, only keep the top tokens with cumulative probability >= top_p (nucleus filtering). Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751).

Default: 0.95

integer

The number of highest probability tokens to consider for generating the output. If > 0, only keep the top k tokens with highest probability (top-k filtering).

Default: 50

string
Shift + Return to add a new line

The template used to format the prompt before passing it to the model. For no template, you can set this to `{prompt}`.

Default: "### System:\nBelow is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\nTranslate the input from English to Hinglish\n\n### Input:\n{prompt}\n\n### Response:\n "

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

This model doesn't have a readme.