Solving Jigsaw Puzzles in the Wild: Human-Guided Reconstruction of Cultural Heritage Fragments

This paper proposes a human-in-the-loop framework that combines an automatic relaxation-labeling solver with interactive guidance strategies to effectively and efficiently reassemble large-scale, fragmented cultural heritage artifacts in real-world conditions where traditional methods fail.

Omidreza Safaei, Sinem Aslan, Sebastiano Vascon, Luca Palmieri, Marina Khoroshiltseva, Marcello Pelillo

Published 2026-03-09
📖 4 min read☕ Coffee break read

Imagine you are handed a massive, ancient jigsaw puzzle. But this isn't a clean, store-bought puzzle with bright pictures and perfect edges. This is a thousand-piece archaeological mystery where:

  • The pieces are broken, eroded, and missing chunks.
  • The picture is faded, and many pieces look exactly like their neighbors.
  • There are thousands of pieces, and some might even belong to different puzzles mixed together.

If you try to solve this alone, you might stare at it for days. If you give it to a standard computer program, it will likely get confused, make a mess, and give up.

This paper introduces a super-team approach: a "Human-in-the-Loop" system where a smart computer and a human expert work together like a dance partner to solve the puzzle.

The Problem: The Computer Gets Lost

Think of a standard computer solver as a very fast but slightly confused robot.

  • It looks at two pieces and says, "Hey, these edges look a little similar! Let's snap them together!"
  • But because the pieces are worn out, the robot makes mistakes. It might glue two pieces together that almost fit but aren't quite right.
  • Once it makes a mistake, it gets stuck in a "local trap," thinking it has solved the puzzle when it's actually just a jumbled mess. It can't see the big picture.

The Solution: The "Anchor" Strategy

The authors propose a system where the computer does the heavy lifting, but a human acts as the GPS navigator.

Here is how their two main strategies work, using simple analogies:

1. The "Anchor" Method (Iterative Anchoring)

Imagine you are building a sandcastle. You don't try to build the whole castle at once.

  1. Pick a Base: You find one perfect piece (the "Anchor") and lock it firmly in place.
  2. Build Around It: The computer looks only at the pieces that could possibly fit next to that one locked piece. It suggests a few neighbors.
  3. The Human Check: You look at the suggestions. "Yes, that one fits!" or "No, that's wrong."
  4. Lock and Repeat: Once you say "Yes," that new piece becomes a new Anchor. Now the computer looks for pieces to fit around the new anchor.

The Magic: By locking in the correct pieces one by one, you give the computer a stable foundation. It stops guessing wildly and starts building a solid structure, piece by piece.

2. The "Global Refinement" Method (Continuous Interactive Refinement)

This is like having a bird's-eye view of the whole puzzle.

  • The computer tries to arrange the entire puzzle at once.
  • You watch the screen. If you see a section that looks weird or disconnected, you pause the computer.
  • You grab a piece, drag it to the right spot, and say, "Stay there."
  • The computer then re-calculates the whole puzzle, using your correction as a new rule.

Why This is a Game-Changer

The paper tested this on real, messy ancient frescoes (wall paintings) that had been broken into thousands of fragments.

  • The Computer Alone: Failed. It created fragmented, nonsensical clusters.
  • The Human Alone: Took forever. It's too much work for a person to check thousands of pieces manually.
  • The Team (Human + Computer): Won.
    • The computer was fast at finding potential matches.
    • The human was fast at spotting obvious errors and confirming the right ones.
    • Together, they solved the puzzle with much higher accuracy and much less time than either could alone.

The Big Picture

This isn't just about puzzles. It's about restoring history.
Archaeologists often have boxes full of broken pottery or wall fragments from ancient sites. They need to put them back together to understand the past. This system is like giving archaeologists a super-powered pair of glasses that helps them see the connections the computer misses, while the computer does the math the human can't do.

In short: Don't let the computer drive the car alone, and don't let the human drive without a map. Put them in the car together, with the human holding the steering wheel and the computer handling the engine, and they can navigate even the roughest, most broken roads of history.