OpenAI and Broadcom have announced Jalapeño, OpenAI's first custom AI inference chip built specifically for running large language models efficiently at scale. Designed for inference rather than model training, the chip will initially power workloads such as Codex and other customer-facing AI services.
OpenAI says Jalapeño is the first generation of a broader custom silicon roadmap aimed at improving performance, reducing operational costs, and decreasing dependence on NVIDIA hardware. Broadcom contributed its chip design expertise, while OpenAI provided insights from its AI research and infrastructure needs.
Deployment is expected to begin later this year.


.jpg)


