zeke / zistral

An experimental copy of the Mistral LLM

  • Public
  • 3 runs
  • L40S

Input

*string
Shift + Return to add a new line
integer

The maximum number of tokens the model should generate as output.

Default: 256

number

The value used to modulate the next token probabilities.

Default: 1

number

A probability threshold for generating the output. If < 1.0, only keep the top tokens with cumulative probability >= top_p (nucleus filtering). Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751).

Default: 1

integer

The number of highest probability tokens to consider for generating the output. If > 0, only keep the top k tokens with highest probability (top-k filtering).

Default: 50

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.