aodianyun/f5-tts-thai

Public
3 runs

Run aodianyun/f5-tts-thai with an API

Use one of our client libraries to get started quickly. Clicking on a library will take you to the Playground tab where you can tweak different inputs, see the results, and copy the corresponding code to use in your own project.

Input schema

The fields you can use to run this model with an API. If you don't give a value for a field its default value will be used.

Field Type Default value Description
audio
string
参考音频文件(WAV/MP3)。当 no_ref_audio=True 时此参数将被忽略。
text
string
要生成的文本内容
ref_text
string
参考文本(可选;为空则自动识别参考音频)
remove_silence
boolean
True
是否移除生成音频中的静音段
speed
number
1

Min: 0.3

Max: 2

语速(0.3-2.0)
nfe_step
integer
32

Min: 4

Max: 64

NFE 步数(越大质量越好但越慢)
cross_fade_duration
number
0.15

Max: 1

分段拼接淡入淡出(秒)
cfg_strength
number
2

Max: 10

CFG 强度(控制与参考音频相似度)
sway_sampling_coef
number
-1

Min: -5

Max: 5

采样系数(建议 -1 或 2-5)
max_chars
integer
250

Min: 50

Max: 800

长文本分段最大字节数(越小越稳但越慢)
seed
integer
-1
随机种子(-1 表示随机)
no_ref_audio
boolean
False
是否不使用参考音频(纯文本生成)

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{
  "type": "string",
  "title": "Output",
  "format": "uri"
}