• Public
  • 1.2K runs
  • L40S
Iterate in playground

Input

*string
Shift + Return to add a new line

Text prompt to send to the model.

integer
(minimum: 1, maximum: 5)

Number of output sequences to generate

Default: 1

integer
(minimum: 1)

Maximum number of tokens to generate. A word is generally 2-3 tokens

Default: 50

number
(minimum: 0.01, maximum: 5)

Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.

Default: 0.75

number
(minimum: 0.01, maximum: 1)

When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens

Default: 1

number
(minimum: 0.01, maximum: 5)

Penalty for repeated words in generated text; 1 is no penalty, values greater than 1 discourage repetition, less than 1 encourage it.

Default: 1

integer
(minimum: 1)

计算每个可能的下一个 token 的概率分布

Default: 100

Output

你好呀!我叫李红,是中国上海的。上海是中国最大的城市,人口有14亿人。这里有很多现代化的建筑,上海有很多世界级的博物馆,例如上海博物馆。上海还非常擅
Generated in

Run time and cost

This model costs approximately $0.010 to run on Replicate, or 100 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 11 seconds.

Readme

This model doesn't have a readme.