Ovi
Ovi is a unified audio video diffusion model that treats sound and visuals as one generative process. It uses twin DiT backbones with blockwise cross modal fusion to create synchronized speech, effects, and motion from text prompts or text plus image inputs in a single pass.
API Reference
INTEGRATE
Complete technical specification for integration
Request Response
Examples 4
CODE
Ready-to-use code snippets for common workflows
Rain-Soaked Arcade Concourse
{
"taskType": "videoInference",
"taskUUID": "fd9a5ac6-7acc-44ef-a961-cf55b99e41cd",
"model": "runware:190@1",
"positivePrompt": "Animate this rainy retro-futurist arcade concourse into a cohesive audiovisual sequence. The camera drifts forward at walking pace through the passageway as puddles ripple under falling droplets from ceiling leaks. Game cabinet screens pulse and flicker in staggered rhythms, vending machines hum softly, and reflected colors slide across the wet floor. A tram glides past in the background beyond the arches, sending a low metallic rumble through the space. People in translucent raincoats cross frame naturally, some pausing at machines, others hurrying through, with believable footfalls, fabric rustle, distant chatter, and reverberant ambience. Occasional thunder rolls outside and the electrical buzz subtly swells as lights blink overhead. Preserve the composition of the source image while adding cinematic motion, synchronized environmental sound, layered depth, and realistic timing.",
"fps": 24,
"seed": 71241,
"steps": 30,
"inputs": {
"image": "https://assets.runware.ai/assets/inputs/839ea199-a462-46b8-90ce-c703cac214f3.jpg"
}
}{
"taskType": "videoInference",
"taskUUID": "fd9a5ac6-7acc-44ef-a961-cf55b99e41cd",
"videoUUID": "4585f688-3f56-4cdf-9750-de1343c9d355",
"videoURL": "https://vm.runware.ai/video/os/a10dlim3/ws/5/vi/4585f688-3f56-4cdf-9750-de1343c9d355.mp4",
"seed": 71241,
"cost": 0.1171
}Stormy Lighthouse Rescue Approach
{
"taskType": "videoInference",
"taskUUID": "6537bd8a-b38e-4579-9b6f-7f48cd2f4f38",
"model": "runware:190@1",
"positivePrompt": "Animate this storm-battered lighthouse scene into a high-tension cinematic rescue sequence. The helicopter advances through heavy rain and crosswinds, rotor blades slicing mist into spiraling sheets. The lighthouse beacon sweeps across the sea and briefly catches the aircraft fuselage. Waves hammer the rocks in irregular bursts, white spray rising with each impact. Lightning flashes intermittently, revealing texture in the clouds and ocean surface. Camera feel: slow forward drift with subtle handheld turbulence, as if filmed from another aircraft nearby. Motion should feel physically grounded and coherent: rain driven sideways by wind, foam dragged back into the surf, helicopter navigation lights pulsing through the storm, searchlight glancing across wet stone. Audio should emerge naturally from the scene with synchronized thunder cracks, rotor thrum, wind gusts, rain striking metal, distant warning siren from the lighthouse, and booming surf. Moody, realistic, suspenseful, immersive.",
"fps": 24,
"seed": 79445,
"steps": 40,
"inputs": {
"image": "https://assets.runware.ai/assets/inputs/597fde48-a2bd-4a5b-a57d-2e8673b348e1.jpg"
}
}{
"taskType": "videoInference",
"taskUUID": "6537bd8a-b38e-4579-9b6f-7f48cd2f4f38",
"videoUUID": "885007e4-58d3-42e8-9bca-5fa150e75eb7",
"videoURL": "https://vm.runware.ai/video/os/a05d22/ws/5/vi/885007e4-58d3-42e8-9bca-5fa150e75eb7.mp4",
"seed": 79445,
"cost": 0.1472
}Glacial Freight Terminal Dawn
{
"taskType": "videoInference",
"taskUUID": "88fd8e24-b83e-45fd-b606-63f6d3cf3093",
"model": "runware:190@1",
"positivePrompt": "A cinematic aerial drift through a remote polar freight terminal at dawn, starting from a wide establishing view and gliding toward docked cargo airships as ground crews move between steel loading gantries. Fine snow streams across the frozen tarmac, turbine fans spool up with visible exhaust haze, suspended cables sway lightly in the wind, warning beacons blink in rhythmic intervals, and distant cargo containers shift on automated sleds. Emphasize realistic environmental motion, crisp cold atmosphere, volumetric breath from workers, subtle camera parallax, and synchronized industrial soundscape with gusting wind, engine rumble, metallic clanks, radio chatter, hydraulic whines, and echoing announcements. Naturalistic, high-detail, grounded cinematic realism.",
"fps": 24,
"seed": 14447,
"steps": 40,
"inputs": {
"image": "https://assets.runware.ai/assets/inputs/da99c2f7-1bb5-466a-a313-bd44026f790d.jpg"
}
}{
"taskType": "videoInference",
"taskUUID": "88fd8e24-b83e-45fd-b606-63f6d3cf3093",
"videoUUID": "4344d183-7c03-4256-9ab7-d5af1cc20627",
"videoURL": "https://vm.runware.ai/video/os/a05d22/ws/5/vi/4344d183-7c03-4256-9ab7-d5af1cc20627.mp4",
"seed": 14447,
"cost": 0.1459
}Windblown Grassland Radio Relay
{
"taskType": "videoInference",
"taskUUID": "c4bdc7a1-ae63-476b-818c-d89d64111c00",
"model": "runware:190@1",
"positivePrompt": "Animate this still image into a grounded cinematic scene: strong wind ripples through tall grass in waves, guy-wires tremble subtly, loose tarp edge flickers, the service truck suspension shifts slightly in gusts, cloud shadows drift across the field, a flock of small birds lifts from the grass in the distance, the antenna beacon blinks faintly, synchronized natural audio with rushing wind, soft metal creaks, distant turbine-like electrical hum, intermittent radio static and clipped voice fragments from the relay shed, realistic motion, coherent environmental sound design, atmospheric documentary style",
"fps": 24,
"seed": 99804,
"steps": 40,
"inputs": {
"image": "https://assets.runware.ai/assets/inputs/42361c12-04bc-489d-8036-61b72bfa935a.jpg"
}
}{
"taskType": "videoInference",
"taskUUID": "c4bdc7a1-ae63-476b-818c-d89d64111c00",
"videoUUID": "e1373a2c-ce29-425a-8413-64149a954294",
"videoURL": "https://vm.runware.ai/video/os/a03d21/ws/5/vi/e1373a2c-ce29-425a-8413-64149a954294.mp4",
"seed": 99804,
"cost": 0.1466
}