synthesys-ai / synthesys-sonic

  • Public
  • 7.3K runs
  • H100
Iterate in playground

Input

*file

Path to the input image

*file

Path to the input audio file

*string

An enumeration.

number

Dynamic scaling factor for processing

Default: 1

number

Audio guidance scaling factor for processing

Default: 7.5

boolean

Crop the image around the detected face

Default: false

integer

Random seed for reproducibility

integer

Minimum resolution for output video

Default: 512

integer

Number of inference steps to process

Default: 25

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model costs approximately $0.60 to run on Replicate, or 1 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia H100 GPU hardware. Predictions typically complete within 7 minutes. The predict time for this model varies significantly based on the inputs.

Readme

This model doesn't have a readme.