Clear, Compelling Arguments: Rethinking the Foundations of Frontier AI Safety Cases
This paper critiques the current limitations of alignment-focused safety cases for frontier AI by drawing on established methodologies from safety-critical industries to propose a more robust, defensible framework, illustrated through a case study on deceptive alignment and CBRN capabilities.