Google showcases Gemini Omni 3.5 with advanced AI video creation and editing

Google has demonstrated Gemini Omni 3.5, a multimodal AI model that can generate and edit videos using text, images, audio, and video inputs through natural language conversations.

Google has highlighted the capabilities of Gemini Omni 3.5 through a series of video demonstrations showcasing its next-generation multimodal creation features. Gemini Omni enables users to combine text, images, audio, and video inputs to generate high-quality videos grounded in real-world knowledge.

The model also supports conversational video editing, allowing creators to modify scenes, camera angles, lighting, and other elements through natural language instructions.

Google says Gemini Omni represents a major step toward "creating anything from any input," while Gemini 3.5 Flash complements it with advanced reasoning, coding, and agentic workflow capabilities. The technology is being integrated across Gemini, Flow, YouTube Shorts, and other Google AI products.

‍

Google