TokenTrace: Multi-Concept Attribution through Watermarked Token Recovery

TokenTrace is a novel proactive watermarking framework that embeds signatures into text prompt embeddings and initial latent noise to enable robust, query-based disentanglement and attribution of multiple concepts within a single generative AI image, outperforming existing methods in both accuracy and visual quality.

Li Zhang, Shruti Agarwal, John Collomosse, Pengtao Xie, Vishal Asnani

Published 2026-02-24
📖 5 min read🧠 Deep dive

Imagine you are a famous artist who has spent years developing a unique painting style. Suddenly, a powerful AI starts painting pictures that look exactly like your work, but it doesn't give you credit. Worse yet, the AI might mix your style with a picture of a specific dog you trained it on, creating a "hybrid" image. Current tools can't tell who owns the style and who owns the dog; they just see one big, messy picture.

This is the problem TokenTrace solves. It's like a high-tech, invisible detective system for AI-generated art.

Here is how it works, broken down into simple analogies:

1. The Problem: The "Blended Smoothie"

Think of an AI image as a smoothie made from different ingredients.

  • Ingredient A: A specific object (like a "golden retriever").
  • Ingredient B: A specific style (like "Van Gogh's brushstrokes").

Old watermarking methods were like putting a single, giant sticker on the outside of the smoothie cup. If you wanted to know if the "dog" or the "Van Gogh style" was inside, the sticker couldn't tell you. It just said, "This cup has a sticker." If you tried to peel the sticker off to check one ingredient, the whole cup might get ruined.

2. The Solution: Invisible DNA in the Recipe

TokenTrace changes the game. Instead of putting a sticker on the outside (the final picture), it secretly rewrites the recipe (the instructions the AI follows).

Imagine you are baking a cake.

  • The Secret: You want to prove you baked a "Chocolate Cake" and a "Red Velvet Cake" in the same batch.
  • The Old Way: You write "Made by Bob" on the frosting.
  • The TokenTrace Way: You secretly change the flour to have a tiny bit of "Chocolate DNA" and the sugar to have a tiny bit of "Red Velvet DNA."

When the cake is baked, the DNA is baked into the cake itself. You can't see it, and it doesn't change the taste (the picture looks perfect). But because the DNA is in the ingredients, it survives even if you slice the cake, freeze it, or wrap it in foil.

3. How It Works: The "Dual-Conditioning" Strategy

TokenTrace does this by tweaking two things at the same time before the AI starts drawing:

  1. The Text Prompt (The Recipe Card): It slightly alters the words you type (e.g., "a dog") to hide a secret code.
  2. The Initial Noise (The Raw Dough): It slightly changes the random static the AI starts with to hide another part of the code.

By hiding the secret in both the words and the starting noise, the watermark becomes deeply woven into the fabric of the image. It's not just sitting on top; it's part of the image's DNA.

4. The Detective: The "Query-Based" Module

This is the coolest part. When you want to check who owns the image, you don't just scan the whole picture blindly. You use a Query.

Imagine you have a locked box with a mix of gold and silver coins.

  • Old Method: You shake the box and hope to hear a "gold" sound.
  • TokenTrace Method: You hold up a magnet specifically for gold and ask, "Show me the gold!" The system then uses a special lens (the TokenTrace module) to look at the image and say, "Yes, there is a gold coin here!" Then you hold up a magnet for silver and it says, "Yes, there is a silver coin here too!"

In technical terms:

  • You feed the image into the system.
  • You type a question: "Is the dog in this picture?"
  • The system checks the "Dog DNA" hidden in the recipe and confirms it.
  • You then ask: "Is the Van Gogh style in this picture?"
  • The system checks the "Style DNA" and confirms that too.

It can separate the two concepts perfectly, even though they are mixed together in one image.

5. Why This Matters

  • For Artists: It protects your unique style and your specific creations. Even if an AI mixes your style with a celebrity's face, you can prove, "That style is mine," and "That face is theirs."
  • For Everyone: It stops AI from stealing credit. It ensures that when we see a beautiful image, we know exactly who (or what) contributed to its creation.
  • Robustness: Because the secret is baked into the "ingredients" (the text and noise), it survives common tricks like cropping the image, compressing it for the web, or even trying to hack it with AI attacks.

Summary

TokenTrace is like giving every AI-generated image a set of invisible, unbreakable barcodes hidden inside its very DNA. Unlike old methods that put a sticker on the surface, TokenTrace writes the ownership story into the recipe itself. And the best part? You can ask the image specific questions ("Who made the dog?" "Who made the style?") and get a clear, separate answer for each, ensuring everyone gets the credit they deserve.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →