xavriley / sax_transcription

Transcribe saxophone solos directly from audio

  • Public
  • 148 runs
  • GitHub

Input

Output

Run time and cost

This model runs on Nvidia T4 GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

This model accompanies the paper “Reconstructing the Charlie Parker Omnibook using an audio-to-score automatic transcription pipeline” (currently under review).

The model takes either

  • an audio file

or

  • a YouTube url

You can optionally specify a start time and finish time for use with YouTube videos.

The model extracts the saxophone audio, transcribes it to MIDI and then converts the MIDI to sheet music. It returns a MusicXML file which you can import into any sheet music program. It also returns a json file containing syncpoints for use with Soundslice.

More details to follow!