SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models

The paper introduces SPEED, an efficient concept erasure framework for diffusion models that directly edits parameters within a null space—enhanced by influence-based filtering, directed prior augmentation, and invariant equality constraints—to achieve scalable, precise removal of multiple concepts in seconds while preserving the quality of non-target generations.

Ouxiang Li, Yuan Wang, Xinting Hu, Houcheng Jiang, Yanbin Hao, Fuli Feng

Published 2026-03-03
📖 5 min read🧠 Deep dive

Imagine you have a giant, incredibly talented artist named Diffusion. This artist has seen almost every image on the internet and can draw anything you describe, from "a cat in a hat" to "a portrait of a specific celebrity."

However, there's a problem. Sometimes this artist draws things you don't want them to draw. Maybe they keep drawing a specific copyrighted character (like Snoopy), or they keep generating images of a specific celebrity who didn't consent, or even inappropriate content. You want to tell the artist, "Please forget how to draw Snoopy," but you don't want to accidentally make them forget how to draw other dogs, or how to draw other cartoon characters like Mickey Mouse.

This is the challenge of Concept Erasure.

The Old Ways: The "Heavy Hammer" and the "Scissors"

Before this new paper, there were two main ways to fix the artist:

  1. The Heavy Hammer (Training-based): You take the artist back to school for weeks. You show them thousands of pictures and say, "Don't draw Snoopy!" This works, but it takes forever (days or weeks) and costs a lot of money. It's like re-educating a whole person just to stop them from saying one word.
  2. The Scissors (Editing-based): You try to surgically cut out the "Snoopy" part of the artist's brain. This is fast, but the old scissors were clumsy. If you tried to cut out 100 different things at once, the artist would get confused and start drawing weird, distorted versions of everything else. They might forget how to draw a "dog" entirely because you messed up the "Snoopy" part too much.

The New Solution: SPEED

The paper introduces SPEED (Scalable, Precise, and Efficient). Think of SPEED as a Magic Eraser that doesn't just rub things out; it rewrites the artist's brain in a very specific, safe way.

Here is how it works, using simple analogies:

1. The "Safe Zone" (Null Space)

Imagine the artist's brain is a giant library of knowledge. When you want to erase "Snoopy," you don't want to knock over the shelves holding "Mickey Mouse" or "Hello Kitty."

SPEED finds a "Safe Zone" (called a Null Space). This is a special direction in the library where you can move things around without disturbing any other books.

  • The Problem: If you try to erase 100 things at once, the "Safe Zone" gets tiny. It's like trying to walk through a crowded room without bumping into anyone; the more people there are, the harder it is to find a clear path.
  • The SPEED Fix: SPEED is smart about who it asks to move. It doesn't try to protect everyone equally. It focuses only on the people who are most likely to get bumped.

2. The Three Magic Tricks (Prior Knowledge Refinement)

To make this "Safe Zone" work even when erasing 100 celebrities, SPEED uses three clever tricks:

  • Trick #1: The "Who Cares?" Filter (Influence-based Prior Filtering)
    Imagine you are erasing "Snoopy." You ask the artist, "If I change the Snoopy instructions, does it change how you draw a 'Pikachu'?"

    • If the answer is "No, Pikachu stays the same," SPEED ignores Pikachu. It doesn't need to protect Pikachu because it's safe.
    • If the answer is "Yes, Pikachu gets distorted," SPEED puts Pikachu in the "Protect Me" list.
    • Why this helps: By ignoring the things that aren't affected, SPEED keeps the "Safe Zone" big enough to work with, even when erasing 100 things.
  • Trick #2: The "Practice Run" (Directed Prior Augmentation)
    Sometimes, just protecting the exact word "Mickey" isn't enough. What if someone asks for "a cartoon mouse"?
    SPEED creates "practice versions" of the things it wants to protect. It takes "Mickey" and creates slight, safe variations (like "Mickey with a hat," "Mickey in a sketch"). It teaches the artist: "Remember, all these versions of Mickey must stay safe."

    • Crucial Detail: These aren't random scribbles. They are carefully crafted variations that stay true to the original meaning, ensuring the artist doesn't get confused.
  • Trick #3: The "Anchor Points" (Invariant Equality Constraints)
    Some parts of the artist's brain are the "glue" that holds everything together (like the concept of "a face" or "a background"). SPEED identifies these glue parts and locks them in place with a digital padlock. It says, "No matter what we erase, these specific anchors must never move." This prevents the whole picture from falling apart.

The Results: Fast, Clean, and Scalable

The paper shows that SPEED is a game-changer:

  • Speed: It can erase 100 celebrities in just 5 seconds. The old methods would take hours or even days. That's a 350x speedup.
  • Precision: If you tell SPEED to erase "Snoopy," it erases Snoopy perfectly. But if you ask for "Hello Kitty" or "SpongeBob," they look exactly the same as before. The old methods often made Hello Kitty look weird or distorted when trying to erase Snoopy.
  • Scalability: It works just as well for 1 concept as it does for 100. You don't need to redesign the whole system; it just scales up.

The Bottom Line

SPEED is like a highly skilled librarian who can remove 100 specific books from a library in seconds without knocking over a single other book or damaging the shelves. It solves the problem of "how do we stop AI from drawing bad things" without breaking the AI's ability to draw good things.

It's fast, it's precise, and it's ready to be used in the real world to make AI safer and more respectful of privacy and copyright.