This demo runs a series of transfer learning classifiers trained on different audio embeddings.
These classifiers were trained on a mixture of public and in-house MTG datasets.
- MusiCNN. A musically motivated CNN with two variants trained on the Million Song Dataset and the MagnaTagATune.
- VGGish. A large VGG variant trained on a preliminary version of the AudioSet Dataset.
Transfer learning classifiers
Our models consist of single-hidden-layer MLPs trained on the considered embeddings.
These models are part of Essentia Models made by MTG-UPF and are publicly available under CC by-nc-sa and commercial license.