cuuupid / seamless_expressive

Translate audio while keeping the original style, pronunciation and tone of your original audio.

  • Public
  • 772 runs
  • L40S
  • GitHub
  • Paper
  • License

Input

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
file

Provide your input audio in your original language.

string

Provide the original language of the input audio.

Default: "English"

string

Provide the target language for your output audio.

Default: "French"

number

Recommended: 1.0 for English, Mandarin, Spanish; 1.1 for German; 1.2 for French.

Default: 1

Output

text_out

Por favor, mantén el volumen bajo. Acabamos de dormir al bebé.

audio_out

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
Generated in

Run time and cost

This model costs approximately $0.044 to run on Replicate, or 22 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 45 seconds. The predict time for this model varies significantly based on the inputs.