OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security

This paper introduces OSS-CRS, an open-source, locally deployable framework that liberates DARPA's AIxCC cyber reasoning systems from obsolete competition infrastructure, enabling their practical application to discover and patch vulnerabilities in real-world open-source projects, as demonstrated by the successful porting of the first-place Atlantis system to find 10 new bugs.

Andrew Chin, Dongkwan Kim, Yu-Fu Fu, Fabian Fleischer, Youngjoon Kim, HyungSeok Han, Cen Zhang, Brian Junekyu Lee, Hanqing Zhao, Taesoo Kim

Published 2026-03-10
📖 5 min read🧠 Deep dive

Imagine a high-stakes cooking competition where seven teams of brilliant chefs were given a challenge: not just to find a rotten ingredient in a giant, complex kitchen, but to fix it, cook a new dish, and prove it's safe to eat, all without human help.

They succeeded! These "Cyber Reasoning Systems" (CRSs) found bugs in software and wrote the code to patch them. But here's the catch: when the competition ended, the chefs packed up their kitchens. They left behind their recipes, but the kitchens themselves were built on a special, temporary cloud infrastructure that no longer exists. If you tried to use their recipes today, they wouldn't work because they were tied to a specific stove, a specific set of ingredients, and a specific cloud that was turned off.

Enter OSS-CRS: The Universal Kitchen.

This paper introduces OSS-CRS, a new, open-source framework that acts like a universal adapter and a shared kitchen. It takes those competition-winning "recipes" (the AI systems) and lets anyone run them in their own local kitchen to fix real-world software problems.

Here is how it works, broken down into simple concepts:

1. The Problem: The "Locked-in" Chefs

The authors looked at the seven winning teams from the competition and found three big reasons why their systems were useless to the rest of the world:

  • Duplication: Every team built their own version of the same basic tools (like their own stoves, sinks, and ingredient trackers) from scratch. It was like seven teams each building their own refrigerator instead of sharing one.
  • Cloud Lock-in: The systems were designed to run on a specific, temporary cloud server (like Azure) that was shut down after the contest. Trying to run them now is like trying to drive a car that only works on a track that has been demolished.
  • Monolithic Design: The systems were built as giant, single blocks. You couldn't take Team A's amazing "bug-finding robot" and Team B's "patch-writing robot" and combine them. They were stuck together like a solid brick; you couldn't swap out the parts.

2. The Solution: The "Universal Adapter" (OSS-CRS)

The authors built OSS-CRS, which acts as a universal power strip and a shared workspace.

  • The Interface (The Power Strip): Instead of needing a specific plug for every system, OSS-CRS gives every AI system a standard plug. Now, any system can plug in and start working immediately.
  • The Budget Manager (The Wallet): Using AI (Large Language Models) to fix bugs costs money (like buying ingredients). The competition gave teams a $50,000 budget. OSS-CRS has a built-in "wallet" that tracks how much money each AI spends. If an AI tries to spend too much, the system cuts it off, ensuring you don't go bankrupt while testing.
  • The Exchange Table (The Potluck): This is the coolest part. Imagine a potluck dinner.
    • Team A's Fuzzer (a robot that tries to break the software) finds a broken plate.
    • Instead of keeping it, it puts the broken plate on a central table.
    • Team B's Patcher (a robot that fixes things) sees the broken plate on the table, picks it up, and fixes it.
    • They don't need to talk to each other directly; they just share items on the table. This allows different teams' technologies to work together seamlessly.

3. The Result: Real-World Success

To prove it works, the authors took the first-place winner from the competition (a system called ATLANTIS) and "ported" it to this new universal kitchen.

  • The Setup: They ran it on a single computer (no giant cloud needed).
  • The Test: They pointed it at 8 real-world open-source projects (like libraries used by millions of people).
  • The Harvest: In just 24 hours, the system found 10 brand-new bugs (including 3 that were very dangerous) that no one knew about before. It also wrote the code to fix them.

Why This Matters

Before this, if you wanted to use these super-smart AI bug-finders, you had to be part of the specific team that built them or have access to a massive, expensive cloud setup.

OSS-CRS changes the game by:

  1. Democratizing Security: Anyone can now run these advanced AI systems to protect their software.
  2. Encouraging Collaboration: Researchers can now mix and match the best parts of different systems (like taking the best fuzzer from one team and the best patcher from another) to build even stronger defenses.
  3. Saving Money: It manages costs so you don't accidentally spend a fortune on AI credits while testing.

In short: The authors took the "secret sauce" from a high-tech cooking competition, built a universal kitchen where anyone can cook with it, and proved that it can find and fix real-world problems today. It turns a closed, expensive competition into an open, shared tool for keeping the internet safe.