FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing

FreqEdit is a training-free framework that mitigates quality degradation in multi-turn image editing by preserving high-frequency information through feature injection, adaptive spatial modulation, and path compensation, thereby achieving superior identity preservation and instruction following across numerous consecutive iterations.

Yucheng Liao, Jiajun Liang, Kaiqian Cui, Baoquan Zhao, Haoran Xie, Wei Liu, Qing Li, Xudong Mao

Published 2026-03-24
📖 4 min read☕ Coffee break read

Imagine you are a digital artist using a magical paintbrush to edit a photo. You want to change the background, then add a hat, then change the shirt color, then make the person smile.

The Problem: The "Blurry Copy" Effect
With current AI tools, the first edit looks great. But if you keep editing the result of the previous edit (like a game of "telephone"), the image starts to rot.

  • The Face: The person's face starts to melt or look like a different person.
  • The Skin: The skin turns into a smooth, plastic mannequin because all the tiny pores and texture vanish.
  • The Edges: The outlines of objects get jagged and weirdly sharp.

The paper calls this "High-Frequency Loss." Think of an image like a song. The "low frequencies" are the bass and melody (the big shapes, the colors, the general idea). The "high frequencies" are the high-pitched cymbals and crisp details (skin pores, hair strands, fabric texture).

Current AI tools are like a bad microphone that only picks up the bass. Every time you edit, it loses a little more of the "cymbals." After 10 edits, the song is just a dull thud.

The Solution: FreqEdit (The "Detail Preserver")
The authors created a new tool called FreqEdit. It doesn't need to be retrained; it just works on top of existing AI models. It solves the problem with three clever tricks, using a metaphor of restoring a faded photocopy:

1. The "Reference Blueprint" (High-Frequency Injection)

Imagine you have a pristine, high-quality original photo (the "Context Image"). Every time you make a new edit, FreqEdit looks at that original photo and says, "Hey, look at how sharp the eyelashes are in the original. Let's steal that sharpness and paste it onto our new edit."

  • How it works: It uses a mathematical tool called a Wavelet Transform (think of it as a "frequency filter") to separate the "boring big shapes" from the "exciting tiny details." It takes the tiny details from the original and injects them into the new image before the AI finishes drawing it.

2. The "Smart Mask" (Adaptive Injection)

Here's the tricky part: You don't want to steal details from the original if you are trying to change something.

  • Scenario: You want to change a red car to a blue truck.
  • The Mistake: If you just paste the original details everywhere, the AI might try to keep the red car's shape and the blue truck's shape, resulting in a weird red-blue monster.
  • The Fix: FreqEdit is smart. It looks at where you are editing.
    • If you are changing the background: It says, "Okay, I'll steal the details for the background to keep it crisp."
    • If you are changing the car: It says, "Stop! Don't steal the car's details, or the new blue truck won't form. Let the AI create new details for the truck."
    • It creates a dynamic map that decides exactly how much "detail stealing" to do in every square inch of the image.

3. The "Course Correction" (Path Compensation)

Sometimes, if you force too many details back into the image, the AI gets confused. It might start drawing a "ghost" version of the old object and the new object at the same time (like a double-exposure photo).

  • The Fix: FreqEdit acts like a GPS. Every few steps of the drawing process, it checks: "Are we still heading toward the user's instruction?" If the AI has drifted too far because of all the detail injection, FreqEdit gently nudges the drawing back onto the correct path, ensuring the final result actually looks like what you asked for.

The Result

Without FreqEdit, a 10-step editing session looks like a blurry, melted mess.
With FreqEdit, you can edit the image 10, 15, or even 20 times, and the person still looks like the same person, the skin still has texture, and the edges stay crisp.

In Summary:
FreqEdit is like a digital bodyguard for your photos. While the AI tries to simplify the image (which causes the "melting"), FreqEdit constantly whispers, "No, keep the details! Remember the original texture! Don't let the face melt!" ensuring your photo stays sharp and true to life, no matter how many times you edit it.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →