OmniHuman-1

Generate realistic talking human videos from one image

OmniHuman-1

OmniHuman-1 is a ByteDance research model for human video generation from a single image and motion signals like audio. It focuses on accurate lip sync, expressive motion, and strong generalization across portraits, full body shots, cartoons, and stylized avatars.

Commercial use
image-to-videoaudio-to-video