Anthropic’s Responsible Scaling Policy Version 3.0 updates its voluntary framework for managing risks from advanced AI systems. The policy explains how safeguards should scale with increasing capabilities, using “if-then” commitments tied to capability thresholds.
It introduces transparency measures like Frontier Safety Roadmaps and periodic Risk Reports to show how risks and mitigations align. The update separates internal plans from broader industry recommendations and aims to reinforce successful elements of earlier versions while improving accountability.
Anthropic says the policy will evolve as AI advances, balancing practical safeguards with the need to address emerging threats and encourage broader industry risk governance.


