nateraw / audio-super-resolution

AudioSR: Versatile Audio Super-resolution at Scale

  • Public
  • 56.1K runs
  • L40S
  • GitHub
  • Paper
  • License

Input

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
*file

Audio to upsample

integer
(minimum: 10, maximum: 500)

Number of inference steps

Default: 50

number
(minimum: 1, maximum: 20)

Scale for classifier free guidance

Default: 3.5

integer

Random seed. Leave blank to randomize the seed

Output

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
Generated in

This example was created by a different version, nateraw/audio-super-resolution:9c3d3e39.

Run time and cost

This model costs approximately $0.030 to run on Replicate, or 33 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 31 seconds. The predict time for this model varies significantly based on the inputs.

Readme

This model doesn't have a readme.