Real-Time High Quality Lip Synchronization with Latent Space Inpainting
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation.
audio to srt
This model doesn't have a readme.
This model is not yet booted but ready for API calls. Your first API call will boot the model and may take longer, but after that subsequent responses will be fast.
This model runs on L40S.