Official

bytedance / seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

  • Public
  • 100.4K runs
  • Commercial use
Iterate in playground

Input

*string
Shift + Return to add a new line

Text prompt for video generation

file

Input image for image-to-video generation

file

Input image for last frame generation. This only works if an image start frame is given too.

integer

Video duration in seconds

Default: 5

string

Video resolution

Default: "720p"

string

Video aspect ratio. Ignored if an image is used.

Default: "16:9"

integer

Frame rate (frames per second)

Default: 24

boolean

Whether to fix camera position

Default: false

integer

Random seed. Set for reproducible generation

Output

Generated in

Pricing

Model pricing for bytedance/seedance-1-lite. Looking for volume pricing? Get in touch.

Official models are always on, maintained, and have predictable pricing. Learn more.

Check out our docs for more information about how pricing works on Replicate.

When

target resolution is 480p
$0.018

per second of output video

or around 55 seconds for $1

When

target resolution is 720p
$0.036

per second of output video

or around 27 seconds for $1

Readme

Seedance 1.0

A video generation model that creates videos from text prompts and images.

Core Capabilities

Video Generation

  • Text-to-Video (T2V): Generate videos from text descriptions
  • Image-to-Video (I2V): Generate videos from static images with optional text prompts
  • Resolution: Outputs 1080p videos

Motion and Dynamics

  • Wide dynamic range supporting both subtle and large-scale movements
  • Maintains physical realism and stability across motion sequences
  • Handles complex action sequences and multi-agent interactions

Multi-Shot Support

  • Native multi-shot video generation with narrative coherence
  • Maintains consistency in subjects, visual style, and atmosphere across shot transitions
  • Handles temporal and spatial shifts between scenes

Style and Aesthetics

  • Supports diverse visual styles: photorealism, cyberpunk, illustration, felt texture, and others
  • Interprets stylistic prompts accurately
  • Maintains cinematic quality with rich visual details

Prompt Understanding

  • Parses natural language descriptions effectively
  • Stable control over camera movements and positioning
  • Accurate interpretation of complex scene descriptions
  • Strong prompt adherence across generated content

Technical Specifications

  • Model Version: 1.0
  • Output Resolution: 1080p
  • Input Types: Text prompts, images (for I2V mode)
  • Video Length: Multi-shot capable (specific duration limits not specified)

Model Performance

Based on internal benchmarks using SeedVideoBench-1.0 and third-party evaluations:

  • High scores in prompt adherence
  • Strong motion quality ratings
  • Competitive aesthetic quality
  • Effective source image consistency in I2V tasks

Use Cases

  • Creative video content generation
  • Prototype development for film and animation
  • Commercial video production
  • Educational and documentary content
  • Fantasy and surreal video creation

Limitations

  • Performance metrics based on specific benchmark datasets
  • Actual generation quality may vary with prompt complexity
  • Multi-shot consistency depends on prompt clarity and scene descriptions