IDperturb: Enhancing Variation in Synthetic Face Generation via Angular Perturbation

This paper introduces IDperturb, a geometric-driven sampling strategy that enhances intra-class variation in synthetic face generation by perturbing identity embeddings within a constrained angular region, thereby improving the performance and generalizability of face recognition systems trained on the resulting diverse datasets.

Fadi Boutros, Eduarda Caldeira, Tahar Chettaoui, Naser Damer

Published 2026-02-24
📖 4 min read☕ Coffee break read

Imagine you are trying to teach a robot to recognize your face. To do this, you need to show it thousands of pictures of you: smiling, frowning, looking left, looking right, in bright sunlight, and in the dark.

The problem? Privacy laws and ethical concerns mean we can't just grab millions of real photos of real people to train these robots anymore. So, scientists use AI to generate fake (synthetic) faces instead.

Here is the catch: Current AI face generators are like stuck-on-repeat records. If you ask them to generate "Person A," they make a picture of Person A. If you ask again, they make almost the exact same picture again. They are too perfect and too identical.

If you train a robot on these identical fake faces, the robot gets confused when it sees a real person who looks slightly different (maybe they have a different haircut or are squinting). The robot fails because it never learned to handle variation.

Enter IDPERTURB: The "Slight Twist" Strategy

The paper introduces a clever, simple trick called IDPERTURB. Think of it not as changing the person, but as shaking the camera slightly while keeping the subject the same.

Here is how it works, using a few analogies:

1. The "Identity Fingerprint" (The Embedding)

Every face, when analyzed by a computer, gets a unique mathematical "fingerprint" (called an embedding). Imagine this fingerprint is a point on a giant, invisible globe.

  • If you have a point for "You," and another point for "Your Twin," they are close together but not on top of each other.
  • Current AI generators usually pick one single point on this globe and say, "Generate a face for this exact point." The result? A boring, repetitive face.

2. The "Cone of Possibility" (Angular Perturbation)

IDPERTURB says, "Let's not pick just one point. Let's pick a cone-shaped area around that point."

  • Imagine your "You" point is the tip of an ice cream cone.
  • Instead of staring at the tip, IDPERTURB picks random spots inside the cone.
  • These spots are still very close to "You" (so it's still clearly you), but they are slightly different angles.

3. The Result: A Family of Variations

When the AI generates a face using a point from inside that cone, it creates a picture of You, but maybe:

  • You are tilting your head slightly.
  • You are squinting a tiny bit.
  • The lighting feels a little different.
  • Your expression is slightly more relaxed.

It's like taking a photo of yourself, then taking 50 more photos where you just shift your weight, blink differently, or turn your head an inch. You are still unmistakably you, but the photos are diverse enough to teach the robot how to recognize you in the real world.

Why is this a big deal?

The "Goldilocks" Zone:

  • Too little change: The robot gets bored and fails to recognize real people (The "Stuck Record" problem).
  • Too much change: The robot thinks the new photo is a different person entirely (The "Identity Crisis").
  • IDPERTURB: It finds the perfect middle ground. It creates enough variety to make the robot smart, but keeps the changes small enough that the robot knows it's still the same person.

The Analogy of the Art Class

Imagine an art teacher asking students to draw "A Cat."

  • Old Method: The teacher gives the students a single, perfect photo of a cat. Everyone draws the exact same cat. When the teacher shows a real cat with a missing ear, the students are confused.
  • IDPERTURB Method: The teacher gives the students a photo of a cat, but says, "Draw this cat, but imagine it's stretching, or sleeping, or looking at a bird." The students draw many different versions of the same cat. Now, when the teacher shows a real cat, the students recognize it immediately because they've seen many variations.

The Bottom Line

IDPERTURB is a simple, geometric trick that makes synthetic faces less robotic and more realistic without needing to rebuild the AI from scratch. By slightly "wiggling" the mathematical coordinates of a face before generating it, the researchers created training data that is diverse enough to make face-recognition systems much smarter, more robust, and better at handling the messy reality of human faces.

It's a win for privacy (no real photos needed) and a win for technology (smarter AI).

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →