SoK: AI-Augmented Binary Reversing

Original authors: Yujeong Kwon, Yiyue Zhang, Shakhzod Yuldoshkhujaev, Kexin Pei, Dokyung Song, Hyungjoon Koo

Published 2026-06-17

📖 5 min read🧠 Deep dive

Original authors: Yujeong Kwon, Yiyue Zhang, Shakhzod Yuldoshkhujaev, Kexin Pei, Dokyung Song, Hyungjoon Koo

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). ✨ This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you have a delicious, complex cake, but all you have left is the empty, crumpled wrapper and a few crumbs. You know the cake was made with flour, eggs, and sugar, but you don't know the recipe, the brand of the ingredients, or who baked it. Binary reversing is the art of trying to figure out the original recipe just by looking at that wrapper and those crumbs.

This is a huge challenge for computer security experts. When software is built (compiled), the "secret sauce" (the original source code, variable names, and logic) is thrown away, leaving only a machine-readable mess. Trying to understand this mess is like trying to read a book written in a language you don't speak, where the words have been scrambled and the page numbers are missing.

For a long time, humans had to do this manually, which is slow and exhausting. Recently, Artificial Intelligence (AI) has stepped in to help, acting like a super-powered detective. However, because so many different researchers are using AI in different ways, the field has become a bit chaotic and confusing.

This paper is a Systematization of Knowledge (SoK). Think of it as a massive, organized map that brings order to this chaos. The authors looked at 144 research papers published since 2015 to create a unified guide.

Here is the simple breakdown of what they found:

1. The Two-Step Dance (The Pipeline)

The paper explains that AI doesn't just "read" the binary file directly. It's a two-step process:

Step 1: The Human/Tool Detective (Conventional Pipeline): First, traditional tools (like disassemblers) take the messy binary file and turn it into "clues" or artifacts. These clues could be a list of instructions, a map of how the code jumps around (graphs), or a list of numbers.
Step 2: The AI Detective (AI-Augmented Pipeline): The AI then looks at these clues. It doesn't see the raw binary; it sees the clues the tools prepared. It learns patterns from these clues to guess what the original code was doing.

The Analogy: Imagine a crime scene. The police (traditional tools) collect fingerprints, DNA, and photos (artifacts). The AI (the detective) then analyzes those photos and DNA samples to solve the case. The AI isn't at the crime scene itself; it's analyzing the evidence the police gathered.

2. The 22 Different "Cases"

The authors organized all the AI research into 22 different types of "cases" (domains) that AI is trying to solve. They range from simple to very complex:

The Basics: Finding where one function ends and another begins (like finding where one paragraph ends and the next starts).
The Detective Work: Identifying if a piece of code is malware (a virus) or if it was written by a specific hacker group.
The Translation: Trying to guess the original names of variables (like guessing that a variable named x123 was actually called user_password).
The Reconstruction: Trying to rewrite the code back into a human-readable language (decompilation).

3. The Big Problems (Validity Risks)

Even though AI is getting better, the paper warns that there are some serious traps, like a detective who is too confident but actually wrong.

The "Echo Chamber" Problem: Many AI models are trained on the same few types of software (like common Linux tools). If you show them a brand-new type of software they've never seen, they might fail. It's like a student who memorized the answers to last year's test but can't solve a new problem.
The "Garbage In, Garbage Out" Problem: If the traditional tools make a mistake when creating the "clues" (artifacts), the AI will learn from that mistake. If the map is wrong, the AI will get lost.
The "Magic Trick" Problem: Sometimes AI seems to work great, but it's actually just memorizing patterns in the data rather than truly understanding the code. It's like a parrot repeating words it doesn't understand.

4. The Future: From Assistant to Partner

The paper suggests that AI shouldn't just be a tool that gives a single answer. The future is Agentic AI.

Current State: You ask the AI, "Is this code a virus?" and it says "Yes."
Future State: You tell the AI, "I need to understand how this malware steals data." The AI then acts like a partner: it plans a strategy, uses different tools to gather evidence, checks its own work, and says, "I think it steals data because of this specific line, but I'm not 100% sure, so here is the evidence I found."

Summary

This paper is a roadmap for the next decade of AI in cybersecurity. It tells us:

We have a lot of data: We've mapped out 22 different ways AI is being used to reverse-engineer software.
We need better maps: The field is messy, and we need a common language to talk about it.
We need to be careful: AI is powerful, but it can be fooled by bad data, limited training, or tricky code.
The goal isn't to replace humans: The best future is where AI acts as a tireless assistant that gathers evidence and proposes theories, while human experts make the final judgment calls.

In short, the paper says: "AI is a fantastic new tool for unlocking the secrets of software, but we need to stop treating it like magic and start understanding exactly how it works, where it fails, and how to use it safely."

1. The Two-Step Dance (The Pipeline)

2. The 22 Different "Cases"

3. The Big Problems (Validity Risks)

4. The Future: From Assistant to Partner

Summary

Technical Summary: SoK: AI-Augmented Binary Reversing

1. Problem Statement

2. Methodology and Scope

3. Key Contributions

4. Key Findings and Results

4.1. Landscape and Maturity (RQ1)

4.2. Validity Risks in AI-Driven Pipelines (RQ2)

4.3. Challenges and Future Directions (RQ3)

5. Significance

SoK: AI-Augmented Binary Reversing

1. The Two-Step Dance (The Pipeline)

2. The 22 Different "Cases"

3. The Big Problems (Validity Risks)

4. The Future: From Assistant to Partner

Summary

Technical Summary: SoK: AI-Augmented Binary Reversing

1. Problem Statement

2. Methodology and Scope

3. Key Contributions

4. Key Findings and Results

4.1. Landscape and Maturity (RQ1)

4.2. Validity Risks in AI-Driven Pipelines (RQ2)

4.3. Challenges and Future Directions (RQ3)

5. Significance

More like this