
VEED Fabric 1.0
Talking video generation from image and audio inputs
Image to VideoAudio to Video
VEED Fabric 1.0
Talking video generation from image and audio inputs
Image to VideoAudio to Video
VEED Fabric 1.0 Overview
VEED Fabric 1.0 is a multimodal AI model that generates talking videos by animating a static image with synchronized speech and expressive motion. Given a single image and an audio input (either voice recording or text-to-speech), the model produces a short video where the subject’s facial expressions, lip movements, head gestures, and body motion align with the provided audio. It supports diverse input image styles and preserves the appearance of the source visual while delivering natural speech synchronization.