Classification of Histopathology Slides with Persistent Homology Convolutions

This paper introduces Persistent Homology Convolutions, a novel method that captures local topological features in histopathology slides, demonstrating that this approach outperforms standard CNNs in classification accuracy and hyperparameter robustness by effectively integrating geometric information into deep learning models.

Shrunal Pothagoni, Benjamin Schweinhart

Published 2026-03-05
📖 5 min read🧠 Deep dive

Imagine you are a detective trying to solve a mystery: Is this tissue sample healthy, or is it cancerous?

In the world of medicine, pathologists (the detectives) look at tiny slides of tissue under a microscope. They don't just look at the colors; they look at the shape and arrangement of the cells. Are the cells crowded together? Are there holes in the tissue? Are the nuclei (the cell's "brain") split into multiple pieces? These geometric clues are vital for diagnosing diseases like Osteosarcoma (bone cancer).

For a long time, computers have tried to help with this using AI (Artificial Intelligence), specifically a type called Convolutional Neural Networks (CNNs). Think of a CNN as a very smart robot that scans an image, looking for patterns.

The Problem: The Robot is "Topologically Blind"

Here is the catch: Standard AI robots are great at seeing colors and edges, but they are terrible at understanding shape and structure.

Imagine you have a photo of a crowd of people.

  • Standard AI: Counts the people. It sees "lots of heads."
  • The Missing Piece: It doesn't understand how they are standing. Are they holding hands in a circle? Are they standing in a chaotic pile? Are there empty spaces between them?

In medical terms, standard AI often misses the "topology" (the study of shapes and holes). If a tumor causes cells to cluster in a weird, specific way, a standard AI might miss it because it's too focused on individual pixels rather than the big picture of the shape.

The Old Solution: The "Global" Summary

Some researchers tried to fix this by giving the AI a "global summary" of the shape.

  • The Analogy: Imagine trying to describe a city to someone who has never seen it. You give them a single statistic: "This city has 500 parks and 1,000 buildings."
  • The Flaw: This tells you how many things there are, but not where they are. A city with parks scattered everywhere feels very different from a city where all the parks are in one giant block. The "global summary" loses the local details that matter.

The New Solution: Persistent Homology Convolutions (PHC)

The authors of this paper invented a new tool called Persistent Homology Convolutions (PHC).

Think of PHC as a smart, sliding magnifying glass that doesn't just look at pixels, but looks at shapes and holes in real-time.

  1. The Sliding Window: Instead of looking at the whole image at once, the AI slides a small window (like a 32x32 pixel square) across the entire slide, just like a person scanning a document with their eyes.
  2. The "Shape Detective": Inside that small window, the AI doesn't just count pixels. It asks:
    • "How many separate groups of cells are here?"
    • "Are there holes (voids) between the cells?"
    • "Do these holes get bigger or smaller as we zoom out?"
  3. The "Fingerprint": For every little window, the AI creates a tiny "fingerprint" (a vector) that describes the shape of that specific area.
  4. The Assembly: It then stitches all these tiny fingerprints together to build a complete map of the tissue's geometry.

Why is this better?

  • It keeps the "Local" context: It knows that a hole in the top-left corner is different from a hole in the bottom-right. It preserves the arrangement of the cells.
  • It's translation invariant: If you shift the image slightly, the AI still recognizes the same shapes.
  • It's efficient: Because it summarizes the shape into a simple "fingerprint" before feeding it to the main AI, the computer doesn't have to crunch as much raw data. It's like summarizing a 500-page book into a 10-page outline before reading it.

The Results: The Detective Wins

The researchers tested this new method on a dataset of bone cancer slides. They compared three approaches:

  1. Standard AI: Just looking at the raw image.
  2. Old Topology AI: Looking at the whole image's shape summary (the "global" method).
  3. New PHC AI: Using the sliding window shape detective.

The Winner: The New PHC AI won hands down.

  • It achieved 93.9% accuracy (compared to about 91% for standard methods).
  • It was more consistent and didn't get confused by small changes in how the data was set up.
  • It was faster to compute than the old "global" method.

The Big Takeaway

This paper is like upgrading a detective's toolkit. Instead of just giving the detective a list of clues (pixels) or a single summary of the crime scene (global shape), they gave the detective a magnifying glass that highlights the structure of the evidence as they walk through the scene.

By teaching the computer to understand the geometry and arrangement of cells locally, rather than just their colors, we can build AI that diagnoses cancer more accurately, potentially saving more lives.