You're looking at a specific version of this model. Jump to the model overview.

zsxkib /dia:91a8c206

Input

*string
Shift + Return to add a new line

Input text for dialogue generation. Use [S1], [S2] to indicate different speakers and (description) in parentheses for non-verbal cues e.g., (laughs), (whispers).

number
(minimum: 1, maximum: 5)

Controls how closely the audio follows your text. Higher values (3-5) follow text more strictly; lower values may sound more natural but deviate more.

Default: 3

number
(minimum: 0.1, maximum: 2)

Controls randomness in generation. Higher values (1.3-2.0) increase variety; lower values (0.1-0.9) make output more consistent and predictable.

Default: 1.3

number
(minimum: 0.1, maximum: 1)

Controls diversity of word choice. Higher values include more unusual options. Most users shouldn't need to adjust this parameter.

Default: 0.95

number
(minimum: 0.5, maximum: 1.5)

Adjusts playback speed of the generated audio. Values below 1.0 slow down the audio; 1.0 is original speed.

Default: 0.94

integer

Random seed for reproducible results. Use the same seed value to get the same output for identical inputs. Leave blank for random results each time.

Including audio_prompt and 2 more...

Output

No output yet! Press "Submit" to start a prediction.