Readme

Wan 2.7 Image

Generate and edit images using Alibaba’s Wan 2.7 model. This is the standard speed variant — for higher quality output with 4K support and thinking mode, see wan-video/wan-2.7-image-pro.

What it can do

Text-to-image — Describe what you want and get a high-quality image. Supports resolutions up to 2K (2048×2048) with flexible aspect ratios.

Image editing — Provide up to 9 reference images along with a text prompt to edit, restyle, or fuse images together. The model can apply style transfer, swap elements between images, and blend multiple references into a single output.

Image set generation — Create a coherent set of related images from a single prompt. Useful for generating the same character across different scenes (e.g. a cat through four seasons), product shots from different angles, or storyboard sequences. Generates up to 12 images per request.

Inputs

prompt — Text description of what you want to generate or how to edit the input images. Supports up to 5,000 characters.
images — Optional input images for editing, style transfer, or multi-reference generation. Up to 9 images.
size — Output resolution: 1K (~1024×1024), 2K (~2048×2048), or custom dimensions like 1920*1080.
num_outputs — Number of images to generate (1-4, or 1-12 in image set mode).
image_set_mode — Enable coherent image set generation.
thinking_mode — Enhanced reasoning for improved quality (text-to-image only). On by default.
seed — For reproducible results.

Tips

For image editing, put the editing instruction in the prompt and pass the source image(s) in the images input.
For multi-image fusion, pass multiple images and describe how to combine them.
Image set mode works best with structured prompts that describe each image in the set (e.g. “First image: … Second image: …”).
The 1K and 2K size presets automatically maintain aspect ratio when input images are provided.

Links

Model created 3 months, 1 week ago