tomasmcm / openinstruct-mistral-7b

Source: monology/openinstruct-mistral-7b ✦ Quant: TheBloke/openinstruct-mistral-7B-AWQ ✦ Commercially-usable 7B model, based on mistralai/Mistral-7B-v0.1 and finetuned on VMware/open-instruct

  • Public
  • 295 runs
  • L40S
  • Paper
  • License

Input

*string
Shift + Return to add a new line

Text prompt to send to the model.

integer

Maximum number of tokens to generate per output sequence.

Default: 128

number
(minimum: -5, maximum: 5)

Float that penalizes new tokens based on whether they appear in the generated text so far. Values > 0 encourage the model to use new tokens, while values < 0 encourage the model to repeat tokens.

Default: 0

number
(minimum: -5, maximum: 5)

Float that penalizes new tokens based on their frequency in the generated text so far. Values > 0 encourage the model to use new tokens, while values < 0 encourage the model to repeat tokens.

Default: 0

number
(minimum: 0.01, maximum: 5)

Float that controls the randomness of the sampling. Lower values make the model more deterministic, while higher values make the model more random. Zero means greedy sampling.

Default: 0.8

number
(minimum: 0.01, maximum: 1)

Float that controls the cumulative probability of the top tokens to consider. Must be in (0, 1]. Set to 1 to consider all tokens.

Default: 0.95

integer

Integer that controls the number of top tokens to consider. Set to -1 to consider all tokens.

Default: -1

string
Shift + Return to add a new line

List of strings that stop the generation when they are generated. The returned output will not contain the stop strings.

Output

Machine learning is a fascinating field That's for sure, I can't disagree With so many open source tools and libraries There's a model for every problem From linear regression to decision trees To support vector machines and deep neural nets There's no shortage of options, you can use Whatever model is best for your data set But with so many models out there It can be hard to choose which one to use So I recommend starting with the basics And working your way up to more advanced tools So go forth and explore the wonderful world
Generated in

Run time and cost

This model costs approximately $0.0020 to run on Replicate, or 500 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 3 seconds.

Readme

OpenInstruct Mistral-7B

1st among commercially-usable 7B models on the Open LLM Leaderboard!*

This is mistralai/Mistral-7B-v0.1 finetuned on VMware/open-instruct.
Quantized to FP16 and released under the Apache-2.0 license by myself.
Compute generously provided by Higgsfield AI.

Prompt format: Alpaca

Below is an instruction that describes a task. Write a response that appropriately completes the request.

### Instruction:
[your instruction goes here]

### Response:
  • temperature: 0.2
  • top_k: 50
  • top_p 0.95
  • repetition_penalty: 1.1

*as of 21 Nov 2023. “commercially-usable” includes both an open-source base model and a non-synthetic open-source finetune dataset. updated leaderboard results available here.