Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking
The paper introduces Jailbreak Foundry (JBF), a multi-agent system that automatically translates jailbreak research papers into executable modules within a unified harness, enabling rapid, reproducible, and standardized benchmarking of large language model security against rapidly evolving attack techniques.