Wan 2.1

Advanced Open-Source Video Generation with State-of-the-Art Performance

Gallery of Wan 2.1 Creations

Be inspired by the results achieved with Wan 2.1

What is Wan 2.1?

Open and Advanced Large-Scale Video Generation Model

Wan 2.1 is a comprehensive suite of video foundation models that sets new standards in video generation. Built on innovative technologies including a novel 3D VAE architecture and advanced diffusion transformer, it delivers superior performance while remaining accessible to consumer-grade GPUs.

  • State-of-the-Art Performance: Outperforms both open-source and commercial solutions
  • Consumer GPU Compatible: Runs on RTX 4090 with just 8.19GB VRAM
  • Multiple Tasks: Text-to-Video, Image-to-Video, and more
  • Visual Text Generation: First video model supporting both Chinese and English text

Getting Started with Wan 2.1

Quick Guide to Using Our AI Platform

  1. Choose your task: Text-to-Video (14B/1.3B) or Image-to-Video (14B)
  2. Select resolution: 480P or 720P based on your model
  3. Enter your prompt or upload an image

Wan 2.1 Key Features

Advanced Capabilities for Video Generation

Powerful Video VAE

Process 1080P videos of any length while preserving temporal information

Multiple Resolution Support

Generate videos in both 480P and 720P with exceptional quality

Apache 2.0 Licensed

Open-source platform with clear usage rights and community support

Resource Efficient

Generate 5-second 480P videos in just 4 minutes on consumer GPUs

Frequently Asked Questions

 What makes Wan 2.1 different from other video AI models?

Wan 2.1 uniquely combines state-of-the-art performance with consumer GPU compatibility. It can run on 8.19GB VRAM while outperforming both open-source and commercial solutions.

 What video resolutions does Wan 2.1 support?

Wan 2.1 supports both 480P and 720P video generation. The 14B model handles both resolutions, while the efficient 1.3B model is optimized for 480P.

 Is Wan 2.1 suitable for professional use?

Yes! Wan 2.1 offers enterprise-grade performance with its 14B model while maintaining accessibility with the 1.3B version for smaller projects.

 What makes Wan 2.1's architecture special?

Wan 2.1 features a novel 3D causal VAE architecture and advanced diffusion transformer, enabling superior video generation while maintaining efficiency.

 Can Wan 2.1 handle different languages?

Yes! Wan 2.1 is the first video model capable of generating both Chinese and English text in videos, featuring robust text generation capabilities.