You're looking at a specific version of this model. Jump to the model overview.
andreasjansson /musicgen-choral-chord:dee87578
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
| Field | Type | Default value | Description |
|---|---|---|---|
| prompt |
string
|
A description of the music you want to generate.
|
|
| text_chords |
string
|
A text based chord progression condition. Single uppercase alphabet character(eg. `C`) is considered as a major chord. Chord attributes like(`maj`, `min`, `dim`, `aug`, `min6`, `maj6`, `min7`, `minmaj7`, `maj7`, `7`, `dim7`, `hdim7`, `sus2` and `sus4`) can be added to the root alphabet character after `:`.(eg. `A:min7`) Each chord token splitted by `SPACE` is allocated to a single bar. If more than one chord must be allocated to a single bar, cluster the chords adding with `,` without any `SPACE`.(eg. `C,C:7 G, E:min A:min`) You must choose either only one of `audio_chords` below or `text_chords`.
|
|
| bpm |
number
|
BPM condition for the generated output. `text_chords` will be processed based on this value. This will be appended at the end of `prompt`.
|
|
| time_sig |
string
|
Time signature value for the generate output. `text_chords` will be processed based on this value. This will be appended at the end of `prompt`.
|
|
| audio_chords |
string
|
An audio file that will condition the chord progression. You must choose only one among `audio_chords` or `text_chords` above.
|
|
| audio_start |
integer
|
0
|
Start time of the audio file to use for chord conditioning.
|
| audio_end |
integer
|
End time of the audio file to use for chord conditioning. If None, will default to the end of the audio clip.
|
|
| duration |
integer
|
8
|
Duration of the generated audio in seconds.
|
| continuation |
boolean
|
False
|
If `True`, generated music will continue from `audio_chords`. If chord conditioning, this is only possible when the chord condition is given with `text_chords`. If `False`, generated music will mimic `audio_chords`'s chord.
|
| multi_band_diffusion |
boolean
|
False
|
If `True`, the EnCodec tokens will be decoded with MultiBand Diffusion.
|
| normalization_strategy |
None
|
loudness
|
Strategy for normalizing audio.
|
| chroma_coefficient |
number
|
1
Min: 0.5 Max: 2.5 |
Coefficient value multiplied to multi-hot chord chroma.
|
| top_k |
integer
|
250
|
Reduces sampling to the k most likely tokens.
|
| top_p |
number
|
0
|
Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used.
|
| temperature |
number
|
1
|
Controls the 'conservativeness' of the sampling process. Higher temperature means more diversity.
|
| classifier_free_guidance |
integer
|
3
|
Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.
|
| output_format |
None
|
wav
|
Output format for generated audio.
|
| seed |
integer
|
Seed for random number generator. If `None` or `-1`, a random seed will be used.
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}