Tracing Copied Pixels and Regularizing Patch Affinity in Copy Detection

This paper proposes a novel image copy detection framework that combines a pixel coordinate tracking module (PixTrace) with a geometrically-guided contrastive loss (CopyNCE) to enhance fine-grained correspondence learning and achieve state-of-the-art performance on the DISC21 dataset.

Yichen Lu, Siwei Nie, Minlong Lu, Xudong Yang, Xiaobo Zhang, Peng Zhang

Published 2026-02-26
📖 4 min read☕ Coffee break read

Imagine you are a detective trying to solve a mystery: Did someone steal a photo and edit it to look like their own?

In the world of digital images, "Copy Detection" is the job of spotting these stolen, edited photos. For a long time, computers were good at finding exact duplicates (like a photocopy of a photocopy), but they struggled when the thief made clever changes—like cropping a picture, changing the colors, or even cutting out a specific object and pasting it somewhere else.

This paper introduces a new detective team called PixTrace and CopyNCE. Here is how they work, explained simply:

1. The Problem: The "Blind" Detective

Previous AI detectives tried to solve this by looking at the whole picture at once. They would say, "These two photos look 80% similar, so they must be related."

  • The Flaw: If a thief takes a photo of a cat, crops out the tail, and adds a hat, the "whole picture" approach gets confused. It might miss the connection because the overall look changed too much.
  • The Old Way (Heuristic Matching): Some older methods tried to look at small patches (like puzzle pieces) and guess which ones matched. But they were like a detective guessing, "This patch looks kinda like that one," which led to a lot of false alarms (accusing innocent photos) or missed clues.

2. The Solution: The "Digital Paper Trail" (PixTrace)

The authors realized something brilliant: Every edit leaves a trace.

Imagine you have a transparent sheet of paper with a drawing on it (the original image).

  • If you stretch the paper, the drawing stretches.
  • If you cut a piece out, you know exactly where that piece came from.
  • If you rotate it, you know the new angle.

PixTrace is a system that keeps a digital logbook of every single pixel.

  • When the AI "edits" an image to create a training example, it doesn't just save the new picture. It saves a map that says: "Pixel #500 in the new image came from Pixel #500 in the original image."
  • Even if the image is rotated, zoomed, or color-shifted, this logbook knows exactly where every pixel traveled. It's like having a GPS tracker for every single dot of color in the image.

3. The New Training Method: The "Strict Teacher" (CopyNCE)

Now that they have this perfect map (PixTrace), they needed a way to teach the AI to use it. Enter CopyNCE.

Think of the AI as a student taking a test.

  • Old Teachers: Would show the student two photos and say, "These are similar." But they didn't explain why or which parts matched. The student would guess, often getting it wrong.
  • The CopyNCE Teacher: Uses the PixTrace logbook to be incredibly specific. It points to a patch on the "stolen" image and says, "Look! This specific patch here corresponds to this specific patch there. They overlap by 40%."

It forces the AI to learn the geometry of the theft. It teaches the AI: "Don't just guess that the whole image is similar. Prove that the specific pieces fit together like a puzzle."

4. The Result: A Super-Detective

Because the AI learned to track the "footprints" of pixels, it became incredibly good at spotting even the most sophisticated edits.

  • Performance: On a major test (the DISC21 competition), this new method became the world champion. It found stolen images that other methods missed.
  • Interpretability: Unlike other "black box" AI models that just give a "Yes/No" answer, this system can actually show you where the copy happened. It can highlight the specific squirrel in a photo that was copied from another image, proving it knows exactly what it's looking at.

Summary Analogy

Imagine trying to find a specific page torn out of a book and pasted into another book.

  • Old AI: Looks at the two books and says, "They look similar enough." (Often wrong).
  • PixTrace + CopyNCE: Keeps a record of every page number. It sees the torn page, tracks its page number, and says, "Aha! Page 42 in Book B is definitely Page 42 from Book A, even though the ink color changed."

By following the "footprints" of the pixels, this new method makes it nearly impossible for image thieves to hide their edits.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →