"MusiConGen: Rhythm and chord control for Transformer-based text-to-music generation"
Cog implementation of mir-aidj(Taejun Kim)'s 'All-In-One Music Structure Analyzer'
Versatile Audio Super-resolution at Scale which upsamples audio files to 48khz. Longer audio input is possible with this model
MusicGen stereo-medium model fine-tuned on Bbongjjak(뽕짝), the Korean modern electronic pop music genre, with the text token 'Bbongjjak'.
Generate music restricted to chord sequences and tempo
Fine-tune MusicGen small, medium and melody models. Also stereo models available.
MusicGen trained on NewJeans
MusicGen trained on NewJeans with vocals
MusicGen Stereo model fine-tuned on the tracks of Oneohtrix Point Never with the text token "OPN"
MusicGen stereo fine-tuned on Pansori Epic Chant, a Korean folk music with the text token “Korean traditional folk music, pansori”
Remix the music into another styles with MusicGen Chord
MusicGen Stereo Medium fine-tuned on ambient with the text token "breathe"
Generate music in stereo, restricted to chord sequences and tempo
MusicGen Stereo Medium fine-tuned on industrial techno with the text token "construction hymn"
MusicGen fine-tuned on cover-songs by Toad from 'Super Mario' series. Text token : "by toad"
PyTSMod is an open-source library for Time-Scale Modification(eg. time-stretching) algorithms, by Sangeon Yong at MAC Lab, KAIST.
SDXL model Fine-tuned on Shroomie the dog, as text_token 'SHRMI'.
This model is cold. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.