ryan5453/demucs:b26a4313 – Run with an API on Replicate

Version

You're looking at a specific version of this model. Jump to the model overview.

ryan5453 /demucs:b26a4313

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
audio	string		Upload the file to be processed here.
model	string (enum)	htdemucs Options: htdemucs, htdemucs_ft, htdemucs_6s, hdemucs_mmi, mdx, mdx_q, mdx_extra, mdx_extra_q	Choose the demucs audio that proccesses your audio. Options: htdemucs (first version of hybrid transformer demucs), htdemucs_ft (fine-tuned version of htdemucs, separation will take 4 times longer but may be a bit better), htdemucs_6s (adds piano and guitar sources to htdemucs), hdemucs_mmi (hybrid demucs v3), mdx (trained on exclusively MusDB HQ), mdx_q (quantized version of mdx, slightly faster but worse quality), mdx_extra (adds extra training data to mdx, this is what the cog previously used by default), mdx_extra_q (quantized version of mdx_extra, slightly faster but worse quality)
two_stems	string (enum)	Options: drums, bass, other, vocals, guitar, piano	If you just want to isolate one stem, you can choose it here. This does not improve performance, as it just combines all of the stems that you did not choose.
output_format	string (enum)	mp3 Options: mp3, aac, flac, wav	Choose the audio format you would like the result to be returned in.
split	boolean	True	Choose whether or not the audio should be split into chunks.
segment	integer		Choose the size of each chunk. This only has an effect if split is set to true.
clip_mode	string (enum)	rescale Options: rescale, clamp	Choose the strategy for avoiding clipping. Rescaling adjusts the overall scale of a signal to prevent any clipping, while hard clipping limits the signal to a maximum range, distorting parts of the signal that exceed that range.
shifts	integer	1	Choose the amount random shifts for equivariant stabilization. This performs multiple predictions with random shifts of the input and averages them, which makes it x times slower.
overlap	number	0.25	Choose the amount of overlap between prediction windows.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema

{'properties': {'stems': {'items': {'properties': {'audio': {'format': 'uri',
                                                             'title': 'Audio',
                                                             'type': 'string'},
                                                   'name': {'title': 'Name',
                                                            'type': 'string'}},
                                    'required': ['name', 'audio'],
                                    'title': 'DemucsStem',
                                    'type': 'object'},
                          'title': 'Stems',
                          'type': 'array'}},
 'required': ['stems'],
 'title': 'DemucsResponse',
 'type': 'object'}