Gemini Omni Flash

Gemini Omni Flash is Google's multimodal video generation and editing model in the Gemini Omni family. It turns text, photos, and video into 10-second clips with native audio generation, supports photo-to-video creation from up to five reference images, and adds video-to-video plus multi-turn editing workflows. Google positions it as the Gemini app successor to Veo 3.1, combining Gemini's world understanding with conversational control for video creation and editing.

Complete technical specification for integration
Step-by-step tutorials for advanced use cases
Cinematic prompting for Gemini Omni Flash How to prompt Gemini Omni Flash for cinematic video using Google's five-element structure, camera language, and the less-prescriptive sweet spot.
Editing video with Gemini Omni Flash How to edit existing footage with Gemini Omni Flash's inputs.video parameter to relight, restyle, swap weather, or add characters while preserving the source's composition and motion.
Reference-driven video with Gemini Omni Flash How to use Gemini Omni Flash's reference image workflow to lock a visual style, hold a character across scenes, or guide a video through storyboard key beats.