thlz998 / chat-tts

This is an implementation of the ChatTTS as a Cog model.

  • Public
  • 3.1K runs
  • L40S
  • GitHub
  • Paper
  • License
  • Prediction

    thlz998/chat-tts:864cbf22ea816a82f3da143049cd5d4c94b95b58567c98536165a44c74b540d1
    ID
    g2ks2d56edrgg0cfv0kbm9t9m8
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created
    by @thlz998

    Input

    text
    chat T T S 是一款强大的对话式文本转语音模型。它有中英混读和多说话人的能力。 chat T T S 不仅能够生成自然流畅的语音,还能控制[laugh]笑声啊[laugh], 停顿啊[uv_break]语气词啊等副语言现象[uv_break]。这个韵律超越了许多开源模型[uv_break]。 请注意,chat T T S 的使用应遵守法律和伦理准则,避免滥用的安全风险。[uv_break]
    top_k
    20
    top_p
    0.7
    voice
    2222
    prompt
     
    skip_refine
    0
    temperature
    0.3
    custom_voice
    0

    Output

    { "audio_files": [ { "filename": "https://storage.googleapis.com/replicate-files/xzMIhsaUUf1mRiSOpfXefBqRn5m689wtCaKiPeePnJuI0lnuE/20240602-08_53_33-113d70262afdac3689889619d465b2cd.wav", "audio_duration": 29.46, "inference_time": 195.29 } ] }
    Generated in
  • Prediction

    thlz998/chat-tts:864cbf22ea816a82f3da143049cd5d4c94b95b58567c98536165a44c74b540d1
    ID
    pz5wsbkr5drgc0cfv2f9pkfrv4
    Status
    Succeeded
    Source
    Web
    Hardware
    A100 (40GB)
    Total duration
    Created

    Input

    text
    chat T T S is a text to speech model designed for dialogue applications. [uv_break]it supports mixed language input [uv_break]and offers multi speaker capabilities with precise control over prosodic elements [laugh]like like [uv_break]laughter[laugh], [uv_break]pauses, [uv_break]and intonation. [uv_break]it delivers natural and expressive speech,[uv_break]so please [uv_break] use the project responsibly at your own risk.[uv_break]
    top_k
    20
    top_p
    0.7
    voice
    2222
    prompt
     
    skip_refine
    0
    temperature
    0.3
    custom_voice
    0

    Output

    { "audio_files": [ { "filename": "https://storage.googleapis.com/r8-outputs-us-central1-long-term/zE2cHcpQeGXEPSKjCwJlczG9VZaiAKUgdP65E03cofflmA1lA/20240602-11_05_04-e69e86d99788cc217ec9ab0b3e156bc7.wav", "audio_duration": 30.31, "inference_time": 241.26 } ] }
    Generated in

Want to make some of these yourself?

Run this model