You're looking at a specific version of this model. Jump to the model overview.

chenxwh /nova-t2v:efe91027

Input

string
Shift + Return to add a new line

Input prompt

Default: "The camera slowly rotates around a massive stack of vintage televisions that are placed within a large New York museum gallery. Each of the televisions is showing a different program. There are 1950s sci-fi movies with their distinctive visuals, horror movies with their creepy scenes, news broadcasts with moving images and words, static on some screens, and a 1970s sitcom with its characteristic look. The televisions are of various sizes and designs, some with rounded edges and others with more angular shapes. The gallery is well-lit, with light falling on the stack of televisions and highlighting the different programs being shown. There are no people visible in the immediate vicinity, only the stack of televisions and the surrounding gallery space."

string
Shift + Return to add a new line

Specify things to not see in the output

Default: "low quality, deformed, distorted, disfigured, fused fingers, bad anatomy, weird hand"

file

Input image prompt, optional

integer
(minimum: 1, maximum: 128)

Number of inference steps

Default: 128

integer
(minimum: 1, maximum: 100)

Number of diffusion steps

Default: 100

number
(minimum: 1, maximum: 10)

Scale for classifier-free guidance

Default: 7

integer
(minimum: 1, maximum: 10)

Motion Flow

Default: 5

integer

Random seed. Leave blank to randomize the seed

integer

fps for the output video

Default: 12

Output

No output yet! Press "Submit" to start a prediction.