Diagnosing and Repairing Distributed Routing Configurations Using Selective Symbolic Simulation

This paper presents S^2Sim, a novel system that automatically diagnoses and repairs distributed routing configuration errors by selectively simulating symbolic configuration variants to identify intent-compliant fixes, demonstrating high efficiency across networks ranging from 10 to 1,000 nodes.

Rulan Yang, Gao Han, Hanyang Shao, Xiaoqiang Zheng, Xing Fang, Ziyi Wang, Lizhao You, Ruiting Zhou, Linghe Kong, Ennan Zhai, Qiao Xiang, Jiwu Shu

Published 2026-03-10
📖 5 min read🧠 Deep dive

Imagine you are the captain of a massive fleet of ships (the network). Your goal is to get cargo from Port A to Port B safely, avoiding storms and taking specific scenic routes. You have a massive, complex instruction manual (the routing configuration) that tells every ship exactly where to go.

Sometimes, a typo in the manual causes a ship to crash into a reef or take a wrong turn.

The Problem: "It's Broken, But Why?"

Currently, when the fleet gets lost, there are tools that can say, "Hey, the fleet is lost!" (This is Verification). But they are terrible at saying, "Oh, the error is on page 42, line 3, where the captain forgot to turn left."

Fixing the manual usually requires a human expert to read thousands of pages, guess what went wrong, and try to fix it. It's slow, frustrating, and prone to human error.

The Solution: S2Sim (The "Magic Spellchecker")

The paper introduces S2Sim, a new system that doesn't just find the error; it diagnoses the exact typo and repairs the manual automatically.

Here is how it works, using a simple analogy:

1. The "What If" Game (The Contract)

Instead of staring at the broken manual and trying to guess what's wrong, S2Sim plays a game of "What If?"

  • The Goal: It asks, "What would the perfect, error-free manual look like that is almost identical to the one we have?"
  • The Contract: It creates a set of "Golden Rules" (called Contracts). These are simple promises like: "Ship C must always talk to Ship B," or "Ship F must never take the route through the storm."
  • The Insight: If the current manual breaks these Golden Rules, that's where the error is!

2. The "Ghost Ship" Simulation (Symbolic Simulation)

Now, S2Sim runs a simulation. But instead of just running the broken manual, it runs a "Ghost Ship" version.

  • Imagine you are watching a movie of the fleet moving.
  • When the real ships (the broken manual) try to do something wrong (like Ship C refusing to talk to Ship B), the Ghost Ship steps in and says, "No, no! Remember the Golden Rule? You must talk to Ship B."
  • The Ghost Ship forces the fleet to follow the Golden Rules.
  • The Magic: Every time the Ghost Ship has to "force" a change, it puts a red flag on the specific line in the manual that caused the problem.
    • Example: "Ah, the Ghost Ship had to override the manual here. The error is in the 'Export Policy' on Ship C."

3. The "Patch" (Repair)

Once S2Sim has found all the red flags, it doesn't just tell you the problem. It writes the fix.

  • It uses a smart template system. Instead of rewriting the whole manual, it inserts a tiny, precise patch.
  • Analogy: If the error was "Ship F prefers the stormy route," the patch doesn't delete the whole rulebook. It just adds a tiny note: "Ignore the stormy route for this specific cargo."
  • It ensures that fixing one problem doesn't accidentally break another part of the fleet.

Why is this a Big Deal?

1. It Handles the "Big Mess" (Complexity)
Real networks are messy. They have:

  • ACLs: Like bouncers at a club checking IDs.
  • Route Aggregation: Grouping many small streets into one big highway.
  • Multi-path: Taking two roads at once to avoid traffic.
  • Failures: What if a bridge (link) collapses?

Old tools get confused by this complexity. S2Sim is like a master mechanic who understands that if a bridge collapses, the "Golden Rules" change, and it instantly recalculates the best detour without panicking.

2. It's Fast
The researchers tested S2Sim on networks with up to 1,000 nodes (a huge fleet).

  • Old Tools: Could take hours or get stuck forever.
  • S2Sim: Fixed real-world errors in 20 seconds and even massive synthetic networks in 15 minutes.

3. It Works with "Real" Mistakes
They tested it on actual configuration files from major cloud providers and telecom companies. It found and fixed the exact same types of mistakes that human engineers struggle with, like "I forgot to tell the router to talk to its neighbor" or "I set the wrong priority for this route."

The Bottom Line

Think of S2Sim as a self-driving car for network engineers.

  • Old way: You drive the car, it hits a wall, you get out, read the manual, guess why, and try to fix it.
  • S2Sim way: The car senses it's about to hit a wall, instantly calculates the perfect path to avoid it, and rewrites its own GPS instructions so it never makes that mistake again.

It turns a days-long, headache-inducing debugging session into a quick, automated fix, ensuring the internet stays connected and safe.