OmniHuman-1
Generate realistic talking human videos from one image

OmniHuman-1 is a ByteDance research model for human video generation from a single image and motion signals like audio. It focuses on accurate lip sync, expressive motion, and strong generalization across portraits, full body shots, cartoons, and stylized avatars.