suminhthanh / xtts-v2-custom

  • Public
  • 14 runs

Run suminhthanh/xtts-v2-custom with an API

Use one of our client libraries to get started quickly. Clicking on a library will take you to the Playground tab where you can tweak different inputs, see the results, and copy the corresponding code to use in your own project.

Input schema

The fields you can use to run this model with an API. If you don't give a value for a field its default value will be used.

Field Type Default value Description
text
string
Xin chào các bạn
Text to synthesize
speaker
string
Original speaker audio (wav, mp3, m4a, ogg, or flv). Duration should be at least 6 seconds.
language
string (enum)
vi

Options:

vi, en, es, fr, de, it, pt, pl, tr, ru, nl, cs, ar, zh, hu, ko, hi

Output language for the synthesised speech
cleanup_voice
boolean
True
Whether to apply denoising to the speaker audio (microphone recordings)
use_deepfilter
boolean
True
Whether to use deepfilter
normalize_text
boolean
True
Whether to normalize the text
aws_access_key_id
string
AWS ACCESS KEY ID
aws_secret_access_key
string
AWS SECRET ACCESS KEY
bucket_name
string
AWS S3 Bucket Name
cdn_download_url
string
CDN Download URL

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{
  "type": "object",
  "title": "Output",
  "properties": {
    "path": {
      "title": "Path"
    }
  }
}