You're looking at a specific version of this model. Jump to the model overview.
mirelo /video-to-sfx-v1:98ec375b
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
| Field | Type | Default value | Description |
|---|---|---|---|
| seed |
integer
|
Random seed for reproducibility. Use -1 for random seed, or any integer for deterministic results.
|
|
| steps |
integer
|
25
Min: 1 Max: 30 |
Number of processing steps for the generation model. Higher values may improve quality but take longer.
|
| duration |
integer
|
10
Min: 1 Max: 10 |
Duration of the generated sound effects in seconds.
|
| video_path |
string
|
Video file to process for sound effects. Video will be trimmed to 10 sec if longer
|
|
| num_samples |
integer
|
1
Min: 1 Max: 4 |
Number of sound effects to generate. Each sample will be a different variation.
|
| text_prompt |
string
|
|
Text prompt to guide sound effect generation. Optional text to guide the sound generation process.
|
| start_offset |
number
|
0
Max: 300 |
Starting point in the video (in seconds) from which to generate audio. 0 means start from the beginning.
|
| creativity_coef |
number
|
4.5
Min: 1 Max: 10 |
Creativity coefficient to control the creativity of the generated sound. Higher values are more creative.
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'items': {'format': 'uri', 'type': 'string'},
'title': 'Output',
'type': 'array'}