You're looking at a specific version of this model. Jump to the model overview.

bytedance /omni-human-1.5:1060b0f4

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
seed
integer
Random seed for reproducible generation.
audio
string
Input audio file (MP3, WAV, etc.). Duration must be less than 35 seconds. If the audio exceeds 35 seconds, an error will be generated and the generation will fail.
image
string
Input image containing a human subject, face or character.
prompt
string
Optional prompt for precise control of the scene, movements, camera movements, etc. Supports Chinese, English, Japanese, Korean, Spanish, and Indonesian.
fast_mode
boolean
False
Enable fast mode to speed up generation by sacrificing some effects.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}