Gemini Omni is Google's multimodal video model that takes text, images, audio, and existing footage as input and outputs short video clips. The model can create and edit videos, producing high-quality outputs with pronounced capabilities for scene manipulation and visual effects.