aodianyun/f5-tts-thai-v2:77a02a69 | Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
audio	string		参考音频文件（WAV/MP3）。当 no_ref_audio=True 时此参数将被忽略。
text	string		要生成的文本内容
ref_text	string		参考文本（可选；为空则自动识别参考音频）
remove_silence	boolean	True	是否移除生成音频中的静音段
speed	number	1 Min: 0.3 Max: 2	语速（0.3-2.0）
nfe_step	integer	32 Min: 4 Max: 64	NFE 步数（越大质量越好但越慢）
cross_fade_duration	number	0.15 Max: 1	分段拼接淡入淡出（秒）
cfg_strength	number	2 Max: 10	CFG 强度（控制与参考音频相似度）
sway_sampling_coef	number	-1 Min: -5 Max: 5	采样系数（建议 -1 或 2-5）
max_chars	integer	250 Min: 50 Max: 800	长文本分段最大字节数（越小越稳但越慢）
seed	integer	-1	随机种子（-1 表示随机）
no_ref_audio	boolean	False	是否不使用参考音频（纯文本生成，当前暂不支持，会抛出错误）

The shape of the response you’ll get when you run this model with an API.

Schema

{'format': 'uri', 'title': 'Output', 'type': 'string'}