Readme
MusiConGen
Yun-Han Lan, Wen-Yi Hsiao, Hao-Chung Cheng and Yi-Hsuan Yang “MusiConGen: Rhythm and chord control for Transformer-based text-to-music generation” In Proc. Int. Society for Music Information Retrieval Conf. (ISMIR), 2024.
MusiConGen is based on pretrained MusicGen with additional controls: Rhythm and Chords. The project contains inference, training code and training data (youtube list).
Arxiv Paper | Demo
Text Based Chord Conditioning
Text Chord Condition Format
SPACE
is used as split token. Each splitted chunk is assigned to a single bar.C G E:min A:min
- When multiple chords must be assigned in a single bar, then append more chords with
,
.C G,G:7 E:min,E:min7 A:min
- Chord type can be specified after
:
.- Just using a single uppercase alphabet(eg.
C
,E
) is considered as a major chord. maj
,min
,dim
,aug
,min6
,maj6
,min7
,minmaj7
,maj7
,7
,dim7
,hdim7
,sus2
andsus4
can be appended with:
.- eg.
E:dim
,B:sus2
- eg.
- Just using a single uppercase alphabet(eg.
- ‘sharp’ and ‘flat’ can be specified with
#
andb
.- eg.
E#:min
Db
- eg.
BPM and Time Signature
- To create chord chroma,
bpm
andtime_sig
values must be specified.bpm
can be a float value. (eg.132
,60
)- The format of
time_sig
is(int)/(int)
. (eg.4/4
,3/4
,6/8
,7/8
,5/4
)
bpm
andtime_sig
values will be automatically concatenated afterprompt
description value, so you don’t need to specify bpm or time signature information in the description forprompt
.
Licenses
- All code in this repository is licensed under the Apache License 2.0 license.
- The code in the MusiConGen repository is licensed under the Apache License 2.0 license.
- The code in the Audiocraft repository is released under the MIT license as found in the LICENSE file.
- The weights in the Audiocraft repository are released under the CC-BY-NC 4.0 license as found in the LICENSE_weights file.
Citation
@misc{lan2024musicongenrhythmchordcontrol,
title={MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation},
author={Yun-Han Lan and Wen-Yi Hsiao and Hao-Chung Cheng and Yi-Hsuan Yang},
year={2024},
eprint={2407.15060},
archivePrefix={arXiv},
primaryClass={cs.SD},
}