MusicGen Remixer is an app based on MusicGen Chord. Users can upload a music track with vocals, type in the text description prompt, and the app will create a new background track based on the input and then make a remixed music output.
model_version: Model type. Computations take longer when using
prompt: A description of the music you want to generate.
music_input: An audio file input for the remix.
True, the EnCodec tokens will be decoded with MultiBand Diffusion. Not compatible with
normalization_strategy: Strategy for normalizing audio.
beat_sync_threshold: When beat syncing, if the gap between generated downbeat timing and input audio downbeat timing is larger than
beat_sync_threshold, consider the beats are not corresponding.
chroma_coefficient: Coefficient value multiplied to multi-hot chord chroma.
top_k: Reduces sampling to the k most likely tokens.
top_p: Reduces sampling to tokens with cumulative probability of p. When set to
0(default), top_k sampling is used.
temperature: Controls the ‘conservativeness’ of the sampling process. Higher temperature means more diversity.
classifier_free_guidance: Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.
output_format: str = Output format for generated audio. “wav”, “mp3”
seed: Seed for random number generator. If
-1, a random seed will be used.
If the input music has a phase with no chords, (i.e. long intro part, or only vocal break) the model will get confused, and the output quality might go bad.
- Chord recognition from audio file is performed using BTC model, by Jonggwon Park.
- Vocal dropping is implemented using Meta’s
- Downbeat tracking and BPM retrieval is perfromed using All-In-One Music Structure Analyzer by Taejun Kim.
- Beat-syncing is performed with PyTSMod by MAC Lab @KAIST
- All code in this repository is licensed under the Apache License 2.0 license.
- The weights in this repository repository are released under the CC-BY-NC 4.0 license as found in the LICENSE_weights file.
- The code in the Audiocraft repository is released under the MIT license (see LICENSE file).
- The weights in the Audiocraft repository are released under the CC-BY-NC 4.0 license (see LICENSE_weights file).