haoheliu / audio-ldm

Text-to-audio generation with latent diffusion models

  • Public
  • 37.4K runs
  • T4
  • GitHub
  • Paper
  • License
  • Prediction

    haoheliu/audio-ldm:b61392ad
    ID
    md4wuqmzxjahzogiwq5q5al6tm
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created

    Input

    text
    catchy upbeat pop music, kick drum, bouncy
    duration
    5.0
    n_candidates
    3
    guidance_scale
    2.5

    Output

    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    Generated in
  • Prediction

    haoheliu/audio-ldm:b61392ad
    ID
    3zgl32rcf5ahris63opyvlssou
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created

    Input

    text
    a hammer hits a wooden surface
    duration
    5.0
    n_candidates
    3
    guidance_scale
    2.5

    Output

    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    Generated in
  • Prediction

    haoheliu/audio-ldm:b61392ad
    ID
    c24g2srwuvd23ascxoezjlqxfq
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created
    by @jagilley

    Input

    text
    two starships are fighting in space with laser cannons
    duration
    5.0
    n_candidates
    3
    guidance_scale
    2.5

    Output

    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    Generated in

Want to make some of these yourself?

Run this model