Vidu 1.5

Vidu 1.5 multi‑entity text to video with stable scenes

72c013c5-ccbc-41f2-ac99-305787ecd156
Commercial use
Text To VideoImage To VideoReference To Video

Vidu 1.5 is a multimodal text to video model that focuses on multi entity consistency across complex scenes. It keeps multiple characters and objects visually stable across frames and shots. Developers can build long form video workflows that need coherent motion and style control.

Examples

746a9dc3-e9b2-452a-a7d6-3f060790aa08
c06439ab-9c1b-41ef-878d-21074e8a8274
d62af904-ce12-48af-8166-da2008c743cb
cfce1ed1-38ed-4be1-a793-146818a1729a
298dee49-b767-4eb9-a4fb-39c190d141e1
6fc40856-dfdf-4985-b2e9-5c33e630bb21
b785de8c-9d74-4d1d-81e1-beebc672c4f0
6e5cf959-bdac-49b7-a10c-f49adf2a8246
d9746070-d509-4f95-b23e-e1720302bce1