You're looking at a specific version of this model. Jump to the model overview.

cottom /uvr-api:426932a7

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
audio
string
Audio file to separate (upload)
audio_url
string
URL to audio file (alternative to upload)
model
None
demucs:hdemucs_mmi
Model to use for separation
output_mode
None
vocals_instrumental
Output mode: vocals+instrumental or all stems
output_format
None
mp3
Output audio format
mp3_bitrate
None
320
MP3 bitrate in kbps (only for mp3 format)
vr_aggressiveness
number
0.05

Max: 1

[VR Network] Aggressiveness (0.0-1.0). Higher = more vocal removal
vr_window_size
None
512
[VR Network] Window size. Larger = better quality but slower
vr_batch_size
None
4
[VR Network] Batch size for inference
vr_high_end_process
boolean
False
[VR Network] Enable high-end frequency processing
mdx_segment_size
None
256
[MDX] Segment size. Larger = better quality but more memory
mdx_overlap
number
0.75

Max: 0.99

[MDX] Overlap between segments (0.0-0.99)
mdx_denoise
boolean
False
[MDX] Enable denoising
mdxc_segment_size
None
256
[MDXC] Segment size. Larger = better quality but more memory
mdxc_overlap
None
8
[MDXC] Overlap between segments
mdxc_batch_size
None
1
[MDXC] Batch size for inference

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'object'}