INTRODUCING WAN V2.6 TEXT TO IMAGE

WAN V2.6 TEXT TO IMAGE

EVOLUTION OF IMAGE GENERATION

Flexible multilingual image generation model

Example 1
Example 2
Example 3
Example 4
Example 5
Example 6
Example 7
Example 8
Example 9
Example 10
Example 11
Example 12
Example 1
Example 2
Example 3
Example 4
Example 5
Example 6
Example 7
Example 8
Example 9
Example 10
Example 11
Example 12
EDITORIAL FASHION PORTRAIT

EDITORIAL FASHION PORTRAIT

LIFESTYLE BRAND CAMPAIGN

LIFESTYLE BRAND CAMPAIGN

ARTISTIC PORTRAITURE

ARTISTIC PORTRAITURE

Wan v2.6 Text to Image is a text-to-image generation model available on fal.ai, designed to convert descriptive prompts into high-quality images. The model accommodates both text and optional reference images as input, enabling users to guide style and content more precisely. A key feature of Wan v2.6 is its support for prompts in both English and Chinese, making it accessible to a broader user base.

At its core, Wan v2.6 is engineered for versatility in image creation based on detailed user instructions. Users can enter a text prompt up to 2000 characters, describing the desired scene, object, or concept. The model can refine output further when provided with a reference image (via URL), which serves as a visual guide—useful for maintaining consistent style or incorporating specific elements. The reference image should be in JPEG, JPG, PNG (without alpha), BMP, or WEBP format, with each dimension between 384 and 5000 pixels and a maximum size of 10MB. Only one reference image can be supplied per request.

Flexible image sizing is a notable technical capability. Users can specify exact dimensions (height and width, from 1 pixel up to 14,142 pixels) or select from common aspect ratio presets such as 'square_hd', 'square', 'portrait_4_3', 'portrait_16_9', 'landscape_4_3', or 'landscape_16_9'. If no size is specified, the output will match the provided input image size or default to a maximum of 1280x1280 pixels.

To enhance creative control, Wan v2.6 supports a 'negative prompt' field, up to 500 characters, which lets users describe qualities or content they wish to avoid in the generated image (for example: "low resolution, error, worst quality, low quality, deformed"). The model also allows users to set a seed (integer between 0 and 2,147,483,647) for reproducibility, so the same input can yield identical results in the future.

For users requiring multiple variations, the 'max_images' parameter can be set to produce between 1 and 5 images per request, although the actual number may vary depending on model inference.

Safety and responsible use are considered, with an 'enable_safety_checker' parameter (default: true) for content moderation applied to both input and output.

The output consists of generated images in PNG format and, in certain mixed modes, can also include generated text (if enabled). The typical use case is generating new images from textual descriptions, with optional guidance from a reference image. The product supports commercial use and can be accessed interactively via a playground, or programmatically through a documented API schema.

Wan v2.6's configuration options and multilingual support position it well for a variety of creative, design, or commercial image generation tasks requiring control over content, style, output dimensions, and quality. No explicit performance, quality metrics, or unique technical limitations are documented beyond the parameter constraints and safety features outlined above.

Jana dengan model imej paling maju

A woman kneeling in darkness, illuminated by a warm, radiant beam of light emerging from her raised hand.

Langkah 1

Tulis senario anda

Taip arahan yang menerangkan imej diingini dengan butiran gaya, pencahayaan dan komposisi

Langkah 2

AI menjana

Model memahami fizik, pencahayaan dan niat emosi adegan anda

Langkah 3

Mulakan berkongsi

Klik untuk menjana output akhir anda dan muat turun imej gred pengeluaran

Melampaui arahan: Tahap kawalan baru

CINEMATIC SCENE CREATION

CINEMATIC SCENE CREATION

Displays the model’s ability to create cinematic, wide-angle visuals with atmospheric lighting and a trendy filmic look, perfect for storytelling.

CINEMATIC SCENE CREATION
GROUP LIFESTYLE IMAGERY

GROUP LIFESTYLE IMAGERY

Illustrates the generation of lively, aspirational scenes featuring multiple people with precise gender and styling—ideal for lifestyle branding in a modern context.

GROUP LIFESTYLE IMAGERY
ASPIRATIONAL ARCHITECTURAL IMAGE

ASPIRATIONAL ARCHITECTURAL IMAGE

Highlights how the model renders architectural complexity, atmospheric light, and photorealistic details—enhancing modern, aspirational visual storytelling.

ASPIRATIONAL ARCHITECTURAL IMAGE

Banding dengan model serupa

High-end studio product photography of premium wireless over-ear headphones in matte black finish. Dramatic three-point lighting with soft key light from upper left, rim light highlighting the ear cup contours, and subtle fill. Clean white seamless backdrop with soft gradient. Sharp focus on texture details of the leather headband and brushed metal accents. Professional advertising quality, 8K resolution, photorealistic rendering.

Featured example 1
Penantian akhirnya berakhir

Rasai kesempurnaan dengan Wan v2.6 Text to Image

Tukar kepada sintesis berpandukan penalaran hari ini

Soalan Lazim

The model accepts text prompts in English or Chinese (up to 2000 characters) and optionally a single reference image via URL, which can be in JPEG, JPG, PNG (without alpha), BMP, or WEBP format.