You're looking at a specific version of this model. Jump to the model overview.
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
| Field | Type | Default value | Description |
|---|---|---|---|
| audio |
string
|
Audio file to separate (upload)
|
|
| audio_url |
string
|
URL to audio file (alternative to upload)
|
|
| model |
None
|
demucs:hdemucs_mmi
|
Model to use for separation
|
| output_mode |
None
|
vocals_instrumental
|
Output mode: vocals+instrumental or all stems
|
| output_format |
None
|
mp3
|
Output audio format
|
| vr_aggressiveness |
number
|
0.05
Max: 1 |
[VR Network] Aggressiveness (0.0-1.0). Higher = more vocal removal
|
| vr_window_size |
None
|
512
|
[VR Network] Window size. Larger = better quality but slower
|
| vr_batch_size |
None
|
4
|
[VR Network] Batch size for inference
|
| vr_high_end_process |
boolean
|
False
|
[VR Network] Enable high-end frequency processing
|
| mdx_segment_size |
None
|
256
|
[MDX] Segment size. Larger = better quality but more memory
|
| mdx_overlap |
number
|
0.75
Max: 0.99 |
[MDX] Overlap between segments (0.0-0.99)
|
| mdx_denoise |
boolean
|
False
|
[MDX] Enable denoising
|
| mdxc_segment_size |
None
|
256
|
[MDXC] Segment size. Larger = better quality but more memory
|
| mdxc_overlap |
None
|
8
|
[MDXC] Overlap between segments
|
| mdxc_batch_size |
None
|
1
|
[MDXC] Batch size for inference
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'title': 'Output', 'type': 'object'}