Spleeter is Deezer source separation library with pretrained models written in Python and uses Tensorflow.
Convert speech in audio to text