Models
May 29, 2026

Google showcases Gemini Omni 3.5 with advanced AI video creation and editing

Google has demonstrated Gemini Omni 3.5, a multimodal AI model that can generate and edit videos using text, images, audio, and video inputs through natural language conversations.

Google has highlighted the capabilities of Gemini Omni 3.5 through a series of video demonstrations showcasing its next-generation multimodal creation features. Gemini Omni enables users to combine text, images, audio, and video inputs to generate high-quality videos grounded in real-world knowledge.

The model also supports conversational video editing, allowing creators to modify scenes, camera angles, lighting, and other elements through natural language instructions.

Google says Gemini Omni represents a major step toward "creating anything from any input," while Gemini 3.5 Flash complements it with advanced reasoning, coding, and agentic workflow capabilities. The technology is being integrated across Gemini, Flow, YouTube Shorts, and other Google AI products.

#
Google

Read Our Content

See All Blogs
AI system implementation

What "Code as Agent Harness" means for AI systems in 2026

Deveshi Dabbawala

June 9, 2026
Read more
AWS

Amazon Cognito now supports multi-Region replication feature

Deveshi Dabbawala

June 8, 2026
Read more