Readme
Grok Imagine Video Extension
Extend any video with a natural continuation. Give this model a source video and describe what should happen next — it generates a seamless extension from the last frame of the input.
What it does
Video extension takes an existing video and generates new footage that picks up where it left off. The model reads the last frame of your input video and produces a continuation that matches the visual style, motion, and content. You control what happens next with a text prompt.
This is useful for:
- Lengthening clips — turn a 5-second video into an 11-second video
- Adding narrative beats — describe the next action in a scene
- Iterative generation — extend a video multiple times to build longer sequences
- Creative direction — steer the continuation with specific camera moves, actions, or mood changes
How to use it
Provide a source video and a prompt describing what should happen next:
- prompt (required) — text description of what should happen in the extension
- video (required) — source video to extend (MP4 format, H.264/H.265/AV1, 2-15 seconds)
- duration (optional) — length of the extension in seconds, from 2 to 10 (default: 6)
The output is the complete extended video — the original footage plus the generated continuation stitched together.
Prompt tips
Since the model already has visual context from your input video, focus your prompt on what should change:
- Describe the action, not the scene: “The person turns and walks away” rather than “A person standing in a park”
- Mention camera movement: “Slow zoom out to reveal the full landscape” or “Camera pans left to follow the subject”
- Be specific about motion: “The bird takes off with a powerful wingbeat” is better than “The bird flies”
Input video requirements
- Format: MP4 (H.264, H.265, or AV1 codec)
- Duration: 2 to 15 seconds
- Quality: Higher resolution input produces better extensions. 720p works well.
Technical details
- Extension duration: 2-10 seconds (default 6)
- Output: The original video plus the extension, stitched together as a single MP4
- Model: Powered by xAI’s Grok Imagine Video
- Pricing: Billed per second of output video