adirik / styletts2

Generates speech from text

  • Public
  • 131K runs
  • T4
  • GitHub
  • Paper
  • License
Iterate in playground
Run with an API
  • Prediction

    adirik/styletts2:dd4d03b097968361dda9b0563716eb0758d1d5b8aeb890d22bd08634e2bd069c
    ID
    2oakv4bbn7ttwv6bb6jbimzqsq
    Status
    Succeeded
    Source
    Web
    Hardware
    T4
    Total duration
    Created

    Input

    beta
    0.7
    seed
    0
    text
    If the supply of fruit is greater than the family needs, it may be made a source of income by sending the fresh fruit to the market if there is one near enough, or by preserving, canning, and making jelly for sale. To make such an enterprise a success the fruit and work must be first class. There is magic in the word 'Homemade,' when the product appeals to the eye and the palate; but many careless and incompetent people have found to their sorrow that this word has not magic enough to float inferior goods on the market. As a rule large canning and preserving establishments are clean and have the best appliances, and they employ chemists and skilled labor. The home product must be very good to compete with the attractive goods that are sent out from such establishments. Yet for first-class homemade products there is a market in all large cities. All first-class grocers have customers who purchase such goods.
    alpha
    0.3
    reference
    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    diffusion_steps
    10
    embedding_scale
    1.5

    Output

    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    Generated in
  • Prediction

    adirik/styletts2:53fd5081feae9440974d1ef9cae83bf7af5fe18be1646343f37e559f5f80a613
    ID
    ehv426rccpf2wzjpe4yon6oahq
    Status
    Succeeded
    Source
    Web
    Hardware
    T4
    Total duration
    Created

    Input

    beta
    0.7
    seed
    0
    text
    StyleTTS 2 is a text-to-speech model that leverages style diffusion and adversarial training with large speech language models to achieve human-level text-to-speech synthesis.
    alpha
    0.3
    diffusion_steps
    10
    embedding_scale
    1.5

    Output

    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    Generated in

Want to make some of these yourself?

Run this model