atonamy/wan-alpha

Wan-Alpha text-to-video generation with alpha channel, optimized for Replicate deployment. WebM/WebP output with transparency support.

Public
126 runs

Run time and cost

This model runs on Nvidia A100 (80GB) GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

Wan-Alpha - Text-to-Video with Alpha Channel

Generate high-quality videos with transparent backgrounds using AI. Perfect for creating animations, visual effects, and compositing.

What is this?

Wan-Alpha generates videos from text descriptions with full alpha channel support - meaning transparent backgrounds you can overlay on anything.

Example: - Input: “A colorful parrot flying. Transparent background. Close-up shot. Realistic style.” - Output: WebM/WebP video with the parrot on a transparent background

Features

  • Transparent backgrounds - True alpha channel support
  • 🎬 WebM/WebP output - Industry-standard formats with transparency
  • 📐 Multiple resolutions - Including custom 512x512 square format
  • Adjustable FPS - 1-60 frames per second
  • 💎 Quality control - Fine-tune output quality (1-100)

How to Use

Basic Usage

  import replicate

  output = replicate.run(
      "atonamy/wan-alpha",
      input={
          "prompt": "A butterfly flying through flowers. Transparent background. Medium shot.
  Realistic style.",
          "output_format": "webm",
          "resolution": "832*480",
          "num_frames": 81,
          "fps": 16,
          "output_quality": 85
      }
  )

print(output)  # URL to your generated video

Parameters

Required: - prompt (string) - Describe what you want to see. Important: Include “transparent background” and specify shot type (close-up, medium shot, wide shot)

Optional: - output_format (string) - webm or webp (default: webm) - resolution (string) - Options: - 480832 - Portrait - 832480 - Landscape (default) - 7201280 - Portrait HD - 1280720 - Landscape HD - 512512 (fit vertical) - Square, portrait-oriented - 512512 (fit horizontal) - Square, landscape-oriented - num_frames (int) - Number of frames, must be 4n+1 (e.g., 33, 49, 81). Default: 81 (~5 seconds at 16fps) - fps (int) - Frame rate 1-60. Default: 16 - output_quality (int) - Quality 1-100. Default: 85 - sample_steps (int) - Diffusion steps. Default: 4 - guidance_scale (float) - CFG scale. Default: 1.0 - seed (int) - Random seed. -1 for random. Default: -1 - negative_prompt (string) - What to avoid in the generation

Prompt Writing Tips

Good prompts include: 1. “Transparent background” - Essential for transparency 2. Shot type - “Close-up shot”, “Medium shot”, “Wide shot” 3. Subject description - What you want to see 4. Style - “Realistic style”, “Animated style”, etc.

Example prompts:

“A golden dragon flying. Transparent background. Wide shot. Realistic style.”

“Glowing particles floating upward. Transparent background. Close-up. Magical style.”

“A person waving hello. Transparent background. Medium shot. Realistic style.”

Advanced Example

  import replicate

  # Generate high-quality square video
  output = replicate.run(
      "atonamy/wan-alpha",
      input={
          "prompt": "Colorful smoke swirling. Transparent background. Close-up shot. Realistic 
  style.",
          "output_format": "webm",
          "resolution": "512*512 (fit vertical)",
          "num_frames": 49,  # ~3 seconds
          "fps": 24,  # Smoother motion
          "output_quality": 95,  # Higher quality
          "seed": 42  # Reproducible results
      }
  )

Output

  • Format: WebM or WebP file
  • Alpha channel: Full transparency support
  • Duration: Depends on num_frames and fps
    • 81 frames @ 16fps = ~5 seconds
    • 49 frames @ 24fps = ~2 seconds

Use Cases

  • 🎥 Video compositing and VFX
  • 🎮 Game assets and animations
  • 📱 AR filters and effects
  • 🎨 Motion graphics and overlays
  • 📺 Broadcast graphics
  • 🌐 Web animations

Technical Details

  • Model: Wan2.1-T2V-14B with alpha channel adaptation
  • Base resolution: Renders at native supported sizes, then optionally resizes
  • 512x512 formats: Rendered at higher resolution (480x832 or 832x480) then scaled down with transparent padding for best quality

Limitations

  • Requires “transparent background” in prompt for best results
  • Best with simple subjects on transparent backgrounds
  • Complex scenes may have edge artifacts
  • Frame count must follow 4n+1 pattern (33, 49, 81, etc.)

Credits

Based on https://github.com/WeChatCV/Wan-Alpha by the WeChatCV team.

Model created