SFIBA: Spatial-based Full-target Invisible Backdoor Attacks

The paper proposes SFIBA, a spatial-based full-target invisible backdoor attack that ensures trigger specificity and stealthiness in black-box settings by restricting triggers to local spatial regions and employing a frequency-domain injection method, thereby achieving high attack performance while evading existing defenses.

Yangxu Yin, Honglong Chen, Yudong Gao, Peng Sun, Zhishuai Li, Weifeng Liu

Published Tue, 10 Ma
📖 5 min read🧠 Deep dive

Imagine you have a very smart robot chef (a Deep Neural Network) that can perfectly identify ingredients like "tomato," "carrot," or "onion." Now, imagine a hacker wants to trick this robot.

In the past, hackers could only teach the robot one trick: "If you see a tiny red dot, call it a 'carrot'." This is a Single-Target Backdoor. It's useful, but limited. If the hacker wants the robot to call a "tomato" a "carrot" later, they have to retrain the whole robot, which is slow and obvious.

This paper introduces a new, super-sneaky trick called SFIBA (Spatial-based Full-target Invisible Backdoor Attack). Think of it as teaching the robot every possible trick at once, without it ever noticing.

Here is how SFIBA works, broken down into simple concepts:

1. The Goal: The "Master Switch"

Instead of teaching the robot just one trick, the hacker wants to create a Master Switch.

  • Old Way: You can only make the robot think a "dog" is a "cat."
  • SFIBA Way: You can make the robot think a "dog" is a "cat," a "car," or a "banana," depending on which secret signal you use. And you can do this for every single category the robot knows, all at the same time.

2. The Problem: The "Crowded Room"

The biggest problem with doing this is interference.
Imagine trying to whisper a secret to 100 different people in a crowded room. If you shout all the secrets at once, no one hears anything clearly.

  • In AI terms, if you try to inject too many "triggers" (secrets) into the training data, they start fighting each other. The robot gets confused, the tricks stop working, or the changes become visible (like a giant red dot on a picture), alerting the defenders.
  • Also, the hacker doesn't get to see the robot's brain (it's a "Black Box"). They can only change the food (training data) they give the robot, not how the robot thinks.

3. The Solution: "Zoning" and "Invisible Ink"

SFIBA solves this with two main ideas: Spatial Zoning and Frequency Domain Magic.

A. Spatial Zoning (The "Post-it Note" Strategy)

Instead of shouting the whole room, the hacker assigns a specific, tiny zone for each secret.

  • Imagine the image is a large wall.
  • For the "Dog-to-Cat" trick, the secret is hidden in the top-left corner.
  • For the "Dog-to-Car" trick, the secret is hidden in the bottom-right corner.
  • For the "Dog-to-Banana" trick, it's in the middle.
  • Why it works: Because the secrets are in different, non-overlapping corners, they don't bump into each other. The robot learns to look at the top-left corner for one trick and the bottom-right for another. This allows the hacker to control every class without the tricks canceling each other out.

B. Frequency Domain Magic (The "Invisible Ink" Strategy)

Now, how do you hide the secret in that corner without the robot (or a human) seeing it?

  • The Problem: If you just paint a dot on the wall, everyone sees it.
  • The SFIBA Trick: Instead of painting on the "surface" (pixels), the hacker changes the vibrations of the wall.
    • Think of an image like a song. It has a melody (what you see) and a rhythm (the hidden frequencies).
    • SFIBA uses a mathematical tool called FFT (Fast Fourier Transform) to turn the image into a song.
    • It then uses Wavelets (like a super-precise microscope) to find the specific "notes" in the song that correspond to the secret.
    • It tweaks these notes slightly. To your eyes, the song sounds exactly the same. The image looks identical. But to the robot, the "vibration" of that specific corner has changed, triggering the secret command.

4. The "Shape-Shifter" (Morphology Constraints)

To make sure the robot doesn't get confused if the image is rotated or flipped (like a picture of a dog turned sideways), SFIBA gives each secret a unique shape.

  • The "Dog-to-Cat" secret in the top-left corner is shaped like a horizontal line.
  • The "Dog-to-Car" secret in the bottom-right is shaped like a vertical line.
  • Even if the image moves, the robot knows: "Ah, I see a horizontal line in the top-left, so I must call this a cat." This keeps the tricks distinct and robust.

5. The "Dynamic Tuner" (The Volume Knob)

Finally, the system has a smart volume knob.

  • If the secret is too loud, the robot might notice the image looks weird.
  • If it's too quiet, the robot won't hear the command.
  • SFIBA automatically adjusts the "volume" (injection coefficient) for every single image to ensure it's just loud enough to work, but quiet enough to remain invisible. It checks the "quality score" (PSNR) and fine-tunes until it's perfect.

The Result: The Perfect Heist

The paper shows that SFIBA is incredibly effective:

  1. Full Control: It can hijack every class in the robot's brain, not just one.
  2. Invisible: Humans and standard security tools cannot see the difference between a clean image and a poisoned one.
  3. Stealthy: It bypasses current security defenses that try to find backdoors.
  4. Black-Box Friendly: The hacker doesn't need to know how the robot works; they just need to feed it the right "poisoned" food.

In summary: SFIBA is like a master spy who can whisper a different secret to a guard at every single door in a building, using invisible ink and specific hand signals, without ever getting caught or causing a panic. It turns a single-target trick into a full-building takeover.