You're looking at a specific version of this model. Jump to the model overview.

bzikst /higgs-audio-v3-tts-4b:d6c19f0e

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
text
string
Hello, this is Higgs Audio v3 TTS.
Text to synthesize.
reference_audio
string
Optional reference audio for voice cloning (WAV/MP3).
reference_text
string
Transcript of the reference audio (improves cloning quality).
voice
string
default
Preset voice name (ignored if reference_audio is provided).
response_format
None
wav
Output audio format.
temperature
number
0.8

Max: 2

Sampling temperature.
top_p
number
0.8

Max: 1

Nucleus sampling parameter.
max_new_tokens
integer
1024

Min: 1

Max: 4096

Maximum number of tokens to generate.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}