You're looking at a specific version of this model. Jump to the model overview.
tiger-ai-lab /anyv2v:30adf8ca
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
video |
string
|
Input video
|
|
instruct_pix2pix_prompt |
string
|
turn man into robot
|
The first step invovles using timbrooks/instruct-pix2pix to edit the first frame. Specify the prompt for editing the first frame.
|
editing_prompt |
string
|
a man doing exercises for the body and mind
|
Describe the input video
|
editing_negative_prompt |
string
|
Distorted, discontinuous, Ugly, blurry, low resolution, motionless, static, disfigured, disconnected limbs, Ugly faces, incomplete arms
|
Things not to see int the edited video
|
num_inference_steps |
integer
|
50
Min: 1 Max: 500 |
Number of denoising steps
|
guidance_scale |
number
|
9
Min: 1 Max: 20 |
Scale for classifier-free guidance
|
pnp_f_t |
number
|
1
Max: 1 |
Specifies the proportion of time steps in the DDIM sampling process where the convolutional injection is applied. A higher value improves motion consistency. 1.0 indicates injection at every time step
|
pnp_spatial_attn_t |
number
|
1
Max: 1 |
Specifies the proportion of time steps in the DDIM sampling process where the spatial attention injection is applied. A higher value improves motion consistency. 1.0 indicates injection at every time step
|
pnp_temp_attn_t |
number
|
1
Max: 1 |
Specifies the proportion of time steps in the DDIM sampling process where the temporal attention injection is applied. A higher value improves motion consistency. 1.0 indicates injection at every time step
|
ddim_init_latents_t_idx |
integer
|
0
|
This parameter determines the time step index at which to begin sampling from the initial DDIM inversed latents, with a range of [0, num_inference_steps-1]. In the context of a DDIM sampling process where the sampling step is 50, the scheduler progresses through the time steps in the sequence [981, 961, 941, ..., 1]. Therefore, setting ddim_init_latents_t_idx to 0 initiates the sampling from t=981, whereas setting it to 1 starts the process at t=961. A higher index enhances motion consistency with the source video but may lead to flickering and cause the edited video to diverge from the edited first frame.
|
ddim_inversion_steps |
integer
|
500
|
Number of ddim inversion steps
|
seed |
integer
|
Random seed. Leave blank to randomize the seed
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}