Wan2.5-Preview

Wan2.5-Preview AI Text to Video with Native Audio

6a7cf507-6789-40c9-a4f7-8dd573d13889
Commercial use

Each generation will cost $0.0946/s for 720p, or $0.1476/s for 1080p.

720p · 5s$0.473
1080p · 5s$0.738
Text To VideoImage To VideoAudio To Video

Wan2.5-Preview is Alibaba’s multimodal video model in research preview. It supports text to video and image to video with native audio generation for clips around 10 seconds. It offers strong prompt adherence, smooth motion, and multilingual audio for narrative scenes.

Examples

17003e33-c90f-4e2b-8cba-837b572e16c9
c270c7d9-211b-4724-99ad-d24d23af2471
79704a2b-4486-4b21-8d5d-8fd44d930e24
62da65d5-513e-4445-97d3-3e03b1ebdb11
1c29b6b4-84f3-4c5d-b238-efd6f4d334fc
593d30c1-fbb1-4813-8be9-5ab483f6e74b
1c29a79d-3091-4eed-8197-5974c1e645db
face7706-516f-4a59-a54e-88f718003a19
01b03f6a-5a0c-497c-85b3-c1b78fa4b6fb