Google launches Gemini 3 with new coding app and record benchmark scores

Google has launched Gemini 3, its most advanced AI model yet, along with a new agentic coding app called Antigravity. The model achieves record benchmark scores across reasoning, multimodal, and coding tasks.

Google unveiled Gemini 3, its latest and most capable generative AI model, available immediately in the Gemini app and via Google Search’s AI mode.

This model delivers a significant leap in reasoning, multimodal understanding, and tool use. On standard benchmarks, Gemini 3 Pro scored a record 37.4 on “Humanity’s Last Exam” and set new highs on LMArena, WebDev Arena, and agentic coding evaluations.

To support developers, Google also introduced Antigravity, an IDE-like platform where AI agents (powered by Gemini 3) interact directly with code editors, terminals, and browsers to build software autonomously.

The GoML POV

Google’s release of Gemini 3 is a solid leap in multimodal reasoning and agentic coding. But from an enterprise perspective, benchmark wins are only half the story. Models don’t succeed in production because they top HLE or LMArena they succeed when they behave consistently across messy, high-stakes, real-world workloads.

At GoML, across healthcare, finance, and insurance deployments, we’ve learned that enterprises care far more about predictability, governance, latency guarantees, auditability, and cost-efficiency. These remain open questions for Gemini 3. Google’s new coding agent, Antigravity, looks powerful, but its real test is whether it can maintain workflow stability, integrate cleanly with legacy stacks, and operate within enterprise security boundaries.

Gemini 3 is an impressive research milestone, but adoption will depend on how well it performs inside controlled enterprise environments, supports domain-level fine-tuning, and aligns with compliance frameworks like HIPAA, PCI, and SOC2. For GoML, Gemini 3 is a promising entrant in the model ecosystem one that could deliver value once its agentic behavior is validated in production, not just on curated benchmark suites.

Google

Google launches Gemini 3 with new coding app and record benchmark scores

The GoML POV

Read Our Content

GPT Live: OpenAI's new voice model built for real conversation

Deveshi Dabbawala

GoML achieves AWS Healthcare Competency Partnership ahead of launch of upcoming health platform

Siddharth Menon

Accelerate Your AI Adoption

Get an Executive Briefing

HQ

India

Google launches Gemini 3 with new coding app and record benchmark scores

The GoML POV

Read Our Content

GPT Live: OpenAI's new voice model built for real conversation

Deveshi Dabbawala

GoML achieves AWS Healthcare Competency Partnership ahead of launch of upcoming health platform

Siddharth Menon

Accelerate Your AI Adoption

Get an Executive Briefing​

HQ

India​

Get an Executive Briefing

India