Vidu Q3

Multimodal video generation with native audio and intelligent shot planning

Vidu Q3

Vidu Q3 is a multimodal video generation model that creates video with synchronized audio directly from text or images, supports intelligent multi-shot sequencing, and produces complete outputs with stable visuals and embedded subtitles without post-processing.

Commercial use
360p · 1s$0.0455
540p · 1s$0.0455
720p · 1s$0.0975
1080p · 1s$0.1040
text-to-videoimage-to-videoaudio-to-video