You're looking at a specific version of this model. Jump to the model overview.

cottom /uvr-api:fabdd202

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
audio
string
Audio file to separate (upload)
audio_url
string
URL to audio file (alternative to upload)
model
None
demucs:hdemucs_mmi
Model to use for separation
output_format
None
mp3
Output audio format
mp3_bitrate
None
320
MP3 bitrate in kbps
vr_aggressiveness
number
0.05

Max: 1

[VR Network] Aggressiveness of separation (0.0-1.0)
vr_window_size
None
512
[VR Network] Window size for processing
vr_batch_size
None
4
[VR Network] Batch size for inference
vr_high_end_process
boolean
False
[VR Network] Enable high-end frequency processing
mdx_segment_size
None
256
[MDX] Segment size for processing
mdx_overlap
number
0.75

Max: 0.99

[MDX] Overlap between segments (0.0-0.99)
mdx_denoise
boolean
False
[MDX] Enable denoising
mdxc_segment_size
None
256
[MDXC] Segment size for processing
mdxc_overlap
None
8
[MDXC] Overlap between segments
mdxc_batch_size
None
1
[MDXC] Batch size for inference

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'object'}