Ecosystem
December 9, 2025

AWS introduces Nova 2 Omni, their A2A model

Nova 2 Omni is AWS's industry-first multimodal model processing text, image, video, and audio inputs with unified text/image outputs, enabling agents to reason across diverse media like keynote summaries with visuals.

Nova 2 Omni stands as the multimodal pinnacle of the Nova 2 lineup, ingesting text, images, videos, and audio while generating text or image responses from a single model architecture.

It unifies reasoning over mixed modalities for tasks such as analyzing presentations with slides, extracting insights from multimedia content, or powering agents that interpret visual and auditory context alongside text.

By handling diverse inputs natively, Omni simplifies development of cross-media AI applications, reduces model orchestration complexity, and supports richer enterprise use cases like content summarization or interactive visual analysis.

#
AWS

Read Our Content

See All Blogs
Whitepaper

Whitepaper on AI Matic’s Intelligent Document Processing

Akash Chandrasekar

May 13, 2026
Read more
AWS

How we cut a 3-hour AWS observability investigation down to 11 minutes

Sarankumar S

May 12, 2026
Read more