OmniHuman-1

Generate realistic talking human videos from one image

OmniHuman-1

OmniHuman-1 is a ByteDance research model for human video generation from a single image and motion signals like audio. It focuses on accurate lip sync, expressive motion, and strong generalization across portraits, full body shots, cartoons, and stylized avatars.

Commercial use
Image To VideoAudio To Video