Readme
Wan 2.7 Image
Generate and edit images using Alibaba’s Wan 2.7 model. This is the standard speed variant — for higher quality output with 4K support and thinking mode, see wan-video/wan-2.7-image-pro.
What it can do
Text-to-image — Describe what you want and get a high-quality image. Supports resolutions up to 2K (2048×2048) with flexible aspect ratios.
Image editing — Provide up to 9 reference images along with a text prompt to edit, restyle, or fuse images together. The model can apply style transfer, swap elements between images, and blend multiple references into a single output.
Image set generation — Create a coherent set of related images from a single prompt. Useful for generating the same character across different scenes (e.g. a cat through four seasons), product shots from different angles, or storyboard sequences. Generates up to 12 images per request.
Inputs
- prompt — Text description of what you want to generate or how to edit the input images. Supports up to 5,000 characters.
- images — Optional input images for editing, style transfer, or multi-reference generation. Up to 9 images.
- size — Output resolution:
1K(~1024×1024),2K(~2048×2048), or custom dimensions like1920*1080. - num_outputs — Number of images to generate (1-4, or 1-12 in image set mode).
- image_set_mode — Enable coherent image set generation.
- thinking_mode — Enhanced reasoning for improved quality (text-to-image only). On by default.
- seed — For reproducible results.
Tips
- For image editing, put the editing instruction in the prompt and pass the source image(s) in the
imagesinput. - For multi-image fusion, pass multiple images and describe how to combine them.
- Image set mode works best with structured prompts that describe each image in the set (e.g. “First image: … Second image: …”).
- The
1Kand2Ksize presets automatically maintain aspect ratio when input images are provided.