Topologically-based parameter inference for agent-based model selection from spatiotemporal cellular data

This paper introduces TOPAZ, a novel computational pipeline that integrates topological data analysis with Bayesian inference methods to effectively infer parameters and select the most biologically plausible agent-based models from spatiotemporal single-cell imaging data.

Wenzel, A. R., Haughey, P. M., Nguyen, K. C., Nardini, J. T., Haugh, J. M., Flores, K. B.

Published 2026-03-11
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are watching a massive, chaotic dance floor filled with thousands of people (cells). From a bird's-eye view, you see them swirling, clustering, and moving in streams. You want to know: What are the invisible rules they are following to create this beautiful chaos?

Are they just bumping into each other and bouncing away? Or are they also holding hands and trying to face the same direction as their neighbors?

This is the challenge scientists face when studying cell movement. They have amazing cameras that can track every single cell, but turning that video into a clear set of "rules" is incredibly hard.

This paper introduces a new tool called TOPAZ (which stands for TOpologically-based Parameter inference for Agent-based model optimiZation). Think of TOPAZ as a super-smart detective that helps figure out the hidden rules of the cellular dance floor.

Here is how it works, broken down into simple steps:

1. The Problem: Too Much Data, Too Many Guesses

Scientists have two main tools, but both have flaws:

  • The "Agent-Based Model" (ABM): This is like a video game simulator. You write down rules (e.g., "if a cell gets too close to another, it pushes away") and watch the simulation run. The problem? There are so many possible rules and settings that it's like trying to find a specific needle in a haystack by guessing randomly.
  • Topological Data Analysis (TDA): This is a way of looking at the shape of the data. Instead of looking at individual cells, it looks at the "holes" and "loops" in the crowd. It's like looking at a cloud and saying, "That looks like a bunny," without worrying about the individual water droplets. It's great at describing what the shape is, but it doesn't tell you why it formed that way.

2. The Solution: TOPAZ (The Detective)

TOPAZ combines these two tools into a single pipeline. It acts like a detective who uses a special magnifying glass (TDA) to compare the real crime scene (the actual cell video) with different suspect theories (the computer simulations).

Here is the step-by-step process of the investigation:

  • Step 1: Taking a "Topological Snapshot"
    Imagine taking a photo of the dance floor and turning it into a wireframe map. This map highlights the shapes: where are the big empty circles? Where are the tight clusters? TOPAZ creates a "Crocker Plot," which is essentially a heatmap showing how these shapes change over time. It's like turning a complex movie into a simple, color-coded scorecard.

  • Step 2: The "Guess and Check" (ABC)
    The detective runs thousands of simulations with different rules.

    • Scenario A: Maybe the cells just push and pull.
    • Scenario B: Maybe they push, pull, and try to align their direction like a school of fish.
      TOPAZ compares the "scorecard" from the real video with the scorecards from the simulations. If a simulation's scorecard looks almost identical to the real one, that simulation gets a "thumbs up."
  • Step 3: The "Simplicity Test" (Model Selection)
    This is the most clever part. Imagine you have two theories:

    1. Theory A: Cells push and pull. (Simple, 2 rules).
    2. Theory B: Cells push, pull, and align. (Complex, 3 rules).

    If Theory B only explains the data slightly better than Theory A, TOPAZ says, "No thanks, that extra rule isn't worth the complexity." It uses a mathematical rule called BIC (Bayesian Information Criterion) to penalize overly complicated theories. It only picks the complex theory if the data demands it.

3. The Experiment: The Fibroblast Dance

The authors tested TOPAZ using data from fibroblasts (a type of cell involved in wound healing).

  • They created two versions of a simulation: one where cells just bumped into each other, and one where they also tried to align their direction (like a flock of birds).
  • They fed the "alignment" simulation data into TOPAZ and asked, "Which rules created this?"
  • The Result: TOPAZ correctly identified that the "alignment" rule was necessary. It realized that the simple "bump and push" model couldn't explain the specific shapes and streams seen in the data.

The Big Picture Takeaway

Think of TOPAZ as a translator.

  • On one side, you have biologists with high-tech cameras showing complex cell movements.
  • On the other side, you have mathematicians with abstract equations trying to describe those movements.

TOPAZ translates the visual "shape" of the cell movement into a specific set of mathematical rules. It tells us not just that the cells are moving in a certain way, but exactly which biological interactions (like pushing, pulling, or aligning) are responsible for it.

Why does this matter?
If we can figure out the rules of how cells move, we can better understand how wounds heal, how tumors spread, or how tissues form. TOPAZ gives scientists a reliable way to test their theories against real-world data, ensuring they aren't just making up complicated stories that don't actually fit the facts.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →