ID
by67sg9dxdrm80cpjat9x3apxw
Status
Succeeded
Source
Web
Total duration
Created
Webhook

Input

text
Speech-02-series is a Text-to-Audio and voice cloning technology that offers voice synthesis, emotional expression, and multilingual capabilities. The HD version is optimized for high-fidelity applications like voiceovers and audiobooks. While the turbo one is designed for real-time applications with low latency. When using this model on Replicate, each character represents 1 token.
voice_id
Deep_Voice_Man
speed
1
volume
1
pitch
0
emotion
angry
english_normalization
true
sample_rate
32000
bitrate
128000
channel
mono
language_boost
English

Output

Generated in
Input tokens
380
Output tokens
1
Tokens per second
0.42 tokens / second
Time to first token