sakemin/musicongen:1e27616c | Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

sakemin /musicongen:1e27616c

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
prompt	string	A laid-back blues shuffle with a relaxed tempo, warm guitar tones, and a comfortable groove, perfect for a slow dance or a night in. Instruments: electric guitar, bass, drums.	A description of the music you want to generate.
text_chords	string	C G A:min F	A text based chord progression condition. Single uppercase alphabet character(eg. `C`) is considered as a major chord. Chord attributes like(`maj`, `min`, `dim`, `aug`, `min6`, `maj6`, `min7`, `minmaj7`, `maj7`, `7`, `dim7`, `hdim7`, `sus2` and `sus4`) can be added to the root alphabet character after `:`.(eg. `A:min7`) Each chord token splitted by `SPACE` is allocated to a single bar. If more than one chord must be allocated to a single bar, cluster the chords adding with `,` without any `SPACE`.(eg. `C,C:7 G, E:min A:min`) You must choose either only one of `audio_chords` below or `text_chords`.
bpm	number	120	BPM condition for the generated output. `text_chords` will be processed based on this value. This will be appended at the end of `prompt`.
time_sig	string	4/4	Meter value for the generate output. `text_chords` will be processed based on this value. This will be appended at the end of `prompt`.
audio_chords	string		An audio file that will condition the chord progression. You must choose only one among `audio_chords` or `text_chords` above.
audio_start	integer	0	Start time of the audio file to use for chord conditioning.
audio_end	integer		End time of the audio file to use for chord conditioning. If -1 or None, will default to the end of the audio clip.
duration	integer	30	Duration of the generated audio in seconds.
multi_band_diffusion	boolean	False	If `True`, the EnCodec tokens will be decoded with MultiBand Diffusion. Not compatible with stereo models.
normalization_strategy	None	peak	Strategy for normalizing audio.
top_k	integer	250	Reduces sampling to the k most likely tokens.
top_p	number	0	Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used.
temperature	number	1	Controls the 'conservativeness' of the sampling process. Higher temperature means more diversity.
classifier_free_guidance	integer	3	Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.
output_format	None	wav	Output format for generated audio.
seed	integer		Seed for random number generator. If `None` or `-1`, a random seed will be used.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema

{'format': 'uri', 'title': 'Output', 'type': 'string'}