Google has highlighted the capabilities of Gemini Omni 3.5 through a series of video demonstrations showcasing its next-generation multimodal creation features. Gemini Omni enables users to combine text, images, audio, and video inputs to generate high-quality videos grounded in real-world knowledge.
The model also supports conversational video editing, allowing creators to modify scenes, camera angles, lighting, and other elements through natural language instructions.
Google says Gemini Omni represents a major step toward "creating anything from any input," while Gemini 3.5 Flash complements it with advanced reasoning, coding, and agentic workflow capabilities. The technology is being integrated across Gemini, Flow, YouTube Shorts, and other Google AI products.




