sakemin / audiosr-long-audio

Versatile Audio Super-resolution at Scale which upsamples audio files to 48khz. Longer audio input is possible with this model

  • Public
  • 1.8K runs
  • GitHub
  • Paper

Input

Output

Run time and cost

This model costs approximately $0.19 to run on Replicate, or 5 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A40 (Large) GPU hardware. Predictions typically complete within 5 minutes. The predict time for this model varies significantly based on the inputs.

Readme

AudioSR: Versatile Audio Super-resolution at Scale

Pass your audio in, AudioSR will make it high fidelity!

Work on all types of audio (e.g., music, speech, dog, raining, …) & to 48khz.

Longer audio input is also compatible with this model, by implementing audio slicing into chunks.