Models
June 24, 2026

OpenAI and Broadcom unveil Jalapeño AI inference chip

OpenAI and Broadcom have introduced Jalapeño, a custom AI inference chip designed to improve performance, lower costs, and reduce reliance on third-party hardware for large-scale AI deployments.

OpenAI and Broadcom have announced Jalapeño, OpenAI's first custom AI inference chip built specifically for running large language models efficiently at scale. Designed for inference rather than model training, the chip will initially power workloads such as Codex and other customer-facing AI services.

OpenAI says Jalapeño is the first generation of a broader custom silicon roadmap aimed at improving performance, reducing operational costs, and decreasing dependence on NVIDIA hardware. Broadcom contributed its chip design expertise, while OpenAI provided insights from its AI research and infrastructure needs.

Deployment is expected to begin later this year.

#
OpenAI

Read Our Content

See All Blogs
Gen AI

Sakana AI Fugu enables one API for smarter routing and better production AI architecture

Sarankumar S

June 23, 2026
Read more
Gen AI

Plumbata saves 95% review time using AI contract management software

Deveshi Dabbawala

June 23, 2026
Read more