Run time and cost

This model runs on Nvidia A100 (40GB) GPU hardware. Predictions typically complete within 45 seconds. The predict time for this model varies significantly based on the inputs.


A watermark-free Modelscope-based video model optimized for producing high-quality 16:9 compositions and a smooth video output. This model was trained using 9,923 clips and 29,769 tagged frames at 24 frames, 576x320 resolution. Zeroscope_v2_567w is specifically designed for upscaling with zeroscope_v2_XL using vid2vid

By cerspense

Thanks to camenduru, kabachuha, ExponentialML, dotsimulate, VANYA, polyware, tin2tin

