Wan2.2 5B
Hybrid text and image to video model for smooth 720p clips

Wan2.2 5B is a compact hybrid text and image to video model that targets 720p 24fps output with strong motion coherence. It supports text only prompts or image guided generation. It is optimized for fast inference on consumer GPUs and fits production video workflows.