Anthropic’s Responsible Scaling Policy: Version 3.

Anthropic released Responsible Scaling Policy Version 3.0, a risk governance framework that updates how it assesses and mitigates AI risks, expands transparency with risk reports and safety roadmaps, and adapts safeguards as model capabilities grow.

Anthropic’s Responsible Scaling Policy Version 3.0 updates its voluntary framework for managing risks from advanced AI systems. The policy explains how safeguards should scale with increasing capabilities, using “if-then” commitments tied to capability thresholds.

It introduces transparency measures like Frontier Safety Roadmaps and periodic Risk Reports to show how risks and mitigations align. The update separates internal plans from broader industry recommendations and aims to reinforce successful elements of earlier versions while improving accountability.

Anthropic says the policy will evolve as AI advances, balancing practical safeguards with the need to address emerging threats and encourage broader industry risk governance.

Anthropic