You're looking at a specific version of this model. Jump to the model overview.
bytedance /omni-human-1.5:1060b0f4
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
| Field | Type | Default value | Description |
|---|---|---|---|
| seed |
integer
|
Random seed for reproducible generation.
|
|
| audio |
string
|
Input audio file (MP3, WAV, etc.). Duration must be less than 35 seconds. If the audio exceeds 35 seconds, an error will be generated and the generation will fail.
|
|
| image |
string
|
Input image containing a human subject, face or character.
|
|
| prompt |
string
|
Optional prompt for precise control of the scene, movements, camera movements, etc. Supports Chinese, English, Japanese, Korean, Spanish, and Indonesian.
|
|
| fast_mode |
boolean
|
False
|
Enable fast mode to speed up generation by sacrificing some effects.
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}