Text to Video

Prompt

Model

Inspiration

Wan 2.6 AI Video Generator

Experience Alibaba's Wan 2.6 on Cuty.ai — featuring Reference-to-Video generation that lets you star in AI videos. Create up to 15-second clips with multi-shot storytelling, native audio-visual sync & cinematic 1080P quality. Try it free!

Key Features

Discover what makes Wan 2.6 exceptional

Reference-to-Video: Star in Your Own AI Videos

Wan 2.6 introduces China's first Reference-to-Video (R2V) generation. Upload a character reference video containing appearance and voice, then generate entirely new scenes starring that character — preserving their distinctive look and sound across different scenarios.

Reference-to-Video: Star in Your Own AI Videos

Extended 15-Second Videos with Multi-Shot Storytelling

Create videos up to 15 seconds — nearly 50% longer than the previous 10-second limit. The model features intelligent multi-shot narratives with panoramic, close-up, and tracking shots with smooth transitions, automatically converting prompts into professional storyboards.

Extended 15-Second Videos with Multi-Shot Storytelling

Native Audio-Visual Synchronization

Wan 2.6 delivers native AV sync ensuring visuals perfectly match vocals, sound effects, and background music. Support for audio-driven modes means your characters' movements and expressions naturally align with the generated soundscape.

Native Audio-Visual Synchronization

Cinematic 1080P Quality & Style Control

Generate cinematic 1080P videos with realistic portrait textures and natural lighting. Wan 2.6 offers improved instruction-following precision, advanced logical reasoning, and superior artistic style control for professional-grade visual output.

Cinematic 1080P Quality & Style Control

Frequently Asked Questions

Everything you need to know about Wan 2.6

Wan 2.6 is Alibaba's latest AI video generation series unveiled in December 2025. It includes upgrades across five models: R2V (Reference-to-Video), T2V (Text-to-Video), I2V (Image-to-Video), Wan2.6-image, and Wan2.6-T2I, representing a comprehensive evolution of visual generation capabilities.

R2V is China's first reference-to-video generation model. You upload a character reference video containing both appearance and voice, and the model generates new scenes starring that character while preserving their distinctive look and sound — essentially letting you star in AI-generated videos.

Wan 2.6 supports videos up to 15 seconds in length, nearly 50% longer than the previous 10-second limit. This extended duration enables complete narrative arcs suitable for dramas, advertisements, and storytelling content.

Yes. Wan 2.6 features intelligent multi-shot narratives with panoramic, close-up, and tracking shots with smooth transitions. The model can automatically convert text prompts into professional storyboards with multiple camera angles.

Wan 2.6 provides native audio-visual synchronization, ensuring visuals perfectly match vocals, sound effects, and background music. It also supports audio-driven modes where character movements and expressions align naturally with the audio content.

Wan 2.6 is available on Cuty.ai where you can use its text-to-video and image-to-video capabilities directly. The model is also accessible through Alibaba Cloud's Model Studio and the official Wan website.

You can try Wan 2.6 and its Reference-to-Video capabilities on Cuty.ai with our free trial credits. For extended usage and access to all premium features, we offer various subscription plans.

Ready to create with Wan 2.6?

Start generating amazing content with our powerful AI models. Try it free today!