You're looking at a specific version of this model. Jump to the model overview.

zsxkib /memo:be54f93b

Input

*file

Input image (e.g. PNG/JPG).

*file

Input audio (e.g. WAV/MP3).

integer
(minimum: 1, maximum: 200)

Diffusion inference steps

Default: 20

number
(minimum: 1, maximum: 20)

Classifier-free guidance scale

Default: 3.5

integer

Set a random seed (None or 0 for random)

Including resolution and 3 more...

Output

No output yet! Press "Submit" to start a prediction.