acappemin / video-to-audio-and-piano

Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization

  • Public
  • 45 runs
  • GitHub
  • Weights
  • Paper
  • License
Iterate in playground
  • Prediction

    acappemin/video-to-audio-and-piano:d08087903b561981d8fe41af352a027e0e50b725e2a4dc8bd7b233f23dc2bdf1
    ID
    4knj559zd9rma0cpeqy88f83jw
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created
    by @acappemin

    Input

    video
    prompt
    the sound of playing piano
    if_piano
    v2a_num_steps
    25

    Output

    Generated in
  • Prediction

    acappemin/video-to-audio-and-piano:d08087903b561981d8fe41af352a027e0e50b725e2a4dc8bd7b233f23dc2bdf1
    ID
    cw7p1f6tx5rma0cpeqz99rh7q8
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created

    Input

    video
    prompt
    the sound of playing piano
    if_piano
    v2a_num_steps
    25

    Output

    Generated in
  • Prediction

    acappemin/video-to-audio-and-piano:d08087903b561981d8fe41af352a027e0e50b725e2a4dc8bd7b233f23dc2bdf1
    ID
    haxymbrchsrme0cper0vtrd73r
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created

    Input

    video
    prompt
    the sound of ripping paper
    if_piano
    v2a_num_steps
    25

    Output

    Generated in
  • Prediction

    acappemin/video-to-audio-and-piano:d08087903b561981d8fe41af352a027e0e50b725e2a4dc8bd7b233f23dc2bdf1
    ID
    nfhqsz27dhrm80cper19z11x0m
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created

    Input

    video
    prompt
    the sound of race car, auto racing
    if_piano
    v2a_num_steps
    25

    Output

    Generated in

Want to make some of these yourself?

Run this model