Unmixing microinfrared spectroscopic images of cross-sections of historical oil paintings

This paper proposes an unsupervised CNN autoencoder with a novel weighted spectral angle distance loss to enable blind, automated unmixing of complex ATR-μ\muFTIR hyperspectral images from historical oil painting cross-sections, significantly improving the interpretability and scalability of material analysis compared to traditional manual methods.

Shivam Pande, Nicolas Nadisic, Francisco Mederos-Henry, Aleksandra Pizurica

Published Tue, 10 Ma
📖 5 min read🧠 Deep dive

Imagine you have a very old, precious painting, like the famous Ghent Altarpiece. Over hundreds of years, the paint has cracked, faded, and changed. To understand how it was made and how to fix it, scientists take a tiny, microscopic slice of the painting (a "cross-section"). This slice is like a layered sandwich: the top layer might be varnish, the middle layer paint, and the bottom layer the canvas.

Inside this tiny slice, there are dozens of different ingredients mixed together: pigments, oils, binders, and even new chemicals formed by age (like rust or soap).

The Problem: The "Smoothie" Mystery

Scientists use a special machine called ATR-µFTIR to look at this slice. This machine shines infrared light on every single tiny dot (pixel) of the slice and records a "fingerprint" (a spectrum) of what's there.

However, there's a catch. In a real painting, the layers are messy. At any single dot, the machine doesn't just see "Red Paint." It sees a smoothie made of Red Paint, Blue Paint, Oil, and maybe some dust or water vapor from the air.

The machine records this messy mix as a single, complex signal. The goal of the scientists is to "unmix" this smoothie. They want to figure out:

  1. What were the original ingredients? (The Endmembers)
  2. Where exactly are they located in the slice? (The Abundance Maps)

Traditionally, experts had to do this by hand, comparing the messy signals to a library of known chemicals. It was slow, subjective, and often missed things.

The Solution: A Smart AI Detective

The authors of this paper built a new AI tool called FTIR-unmixer. Think of it as a super-smart detective that can look at the messy smoothie and automatically separate the ingredients.

Here is how it works, using simple analogies:

1. The "Patch" Strategy (Looking at Neighborhoods)

Instead of looking at one tiny dot in isolation, the AI looks at a small neighborhood of dots (a 5x5 patch) at a time.

  • Analogy: Imagine trying to identify a person in a crowd. If you look at one face, it's hard. But if you look at the whole group and see who is standing next to whom, it becomes much easier to spot patterns. The AI uses this "neighborhood" logic to understand that if a certain chemical is here, it's likely nearby too.

2. The "Autoencoder" (The Compression Game)

The AI uses a neural network called an Autoencoder.

  • Analogy: Imagine you have a huge, messy suitcase full of clothes (the data). You want to pack it into a tiny, neat box (the ingredients).
    • The Encoder squishes the messy suitcase down to figure out the essential items inside (the ingredients).
    • The Decoder tries to unpack those items to rebuild the suitcase.
    • If the AI can rebuild the suitcase perfectly, it knows it found the right ingredients.

3. The "Weighted Spectral Angle" (Ignoring the Noise)

This is the paper's biggest innovation. The data from the machine is full of "noise."

  • The Problem: Sometimes, the machine picks up signals from the air (like Carbon Dioxide or water vapor) or glitches in the sensor. These are like static on a radio. If the AI listens to the static, it might think the static is a new ingredient, which ruins the analysis.
  • The Fix: The authors created a special rule called WSAD (Weighted Spectral Angle Distance).
    • Analogy: Imagine you are trying to hear a conversation in a noisy room. You naturally tune out the sound of the air conditioner or the traffic outside because you know they aren't part of the conversation.
    • The AI does the same thing. It automatically checks every "channel" of data. If a channel looks weird, spiky, or too flat (like the air pollution), the AI turns down the volume on that channel. It only listens closely to the channels that look like real chemical signals.

The Result: Cleaning Up the Altarpiece

The team tested this on a real slice from the Ghent Altarpiece.

  • Without the new rule (Standard AI): The AI got confused by the air pollution (CO2) and thought it was a real part of the painting. The map of where the "metal soap" (a degradation product) was located looked a bit scattered and messy.
  • With the new rule (WSAD): The AI ignored the air pollution. The map of the metal soap became much clearer and matched what experts expected. It successfully separated the "ingredients" (like proteins, metal soaps, and calcium oxalates) from the background noise.

Why This Matters

This isn't just about math; it's about saving history.

  • Speed: It does in minutes what used to take experts hours or days.
  • Accuracy: It removes human bias and doesn't get tricked by air pollution or machine glitches.
  • Discovery: It can find hidden layers or tiny amounts of chemicals that a human eye might miss, helping conservators understand exactly how to restore these priceless artworks without damaging them.

In short, the authors built a smart, noise-canceling AI that can look at a messy, ancient paint slice and perfectly separate the original ingredients, helping us understand and preserve our cultural heritage.