Given a brief prompt minor context and a starting image, create a script video and voice in one go.
This model doesn't have a readme.