Topological Causal Effects

Imagine you are a doctor trying to figure out if a new medicine works. Usually, you look at simple numbers: Did the patient's temperature go down? Did their blood pressure drop? These are like checking the height or weight of a patient. It's easy to measure, but it only tells you part of the story.

What if the medicine doesn't change a patient's weight, but it completely reshapes their internal organs? What if it turns a solid lump into a hollow ring, or connects two separate islands of tissue into one big continent? Standard math tools (which look at simple numbers) would miss this entirely. They would say, "No change!" because the total amount of tissue is the same, even though the shape is totally different.

This paper introduces a new way to measure cause-and-effect that looks at shape and structure instead of just numbers. The authors call this Topological Causal Effects.

Here is a simple breakdown of how it works, using some creative analogies:

1. The Problem: The "Shape" Blind Spot

In the real world, data is often complex. Think of a brain scan, a protein molecule, or a social network.

Old Way: You measure the "average" of everything. It's like trying to describe a sculpture by only measuring its total volume of clay. You miss the holes, the loops, and the twists.
The Issue: If a treatment changes the structure (like creating a new loop in a protein), the old math tools can't see it. They are "shape-blind."

2. The Solution: Topological Data Analysis (TDA)

The authors use a branch of math called Topological Data Analysis. Think of this as a "Shape Detective."

The Persistence Diagram: Imagine you have a pile of sand. As you slowly pour water over it, islands appear and disappear.
- A small puddle might appear and vanish quickly (a short-lived feature).
- A large mountain might stay above the water for a long time (a persistent feature).
- The "Persistence Diagram" is a map that records every island (loop, hole, or connected piece) and how long it lasted as the water level rose.
The Silhouette: To make this map easy to analyze, the authors turn it into a Silhouette. Imagine taking that complex map of islands and flattening it into a single, smooth curve (like a shadow). This curve tells you, "Here is where the big, important loops are, and here is where the tiny, noisy blips are."

3. The Goal: Measuring the "Shape Change"

The paper asks: Does the treatment change the shape of the outcome?

The Scenario: Imagine a group of molecules.
- Group A (Untreated): They look like a tangled ball of yarn with no loops.
- Group B (Treated): The medicine untangles them, creating a perfect ring (a loop).
The Result: The "Silhouette" for Group B will have a big spike where the ring exists. The "Silhouette" for Group A won't.
The Magic: The authors calculate the difference between these two curves. This difference is the Topological Causal Effect. It quantifies exactly how much the treatment changed the structure, not just the average size.

4. The Engine: The "Double-Robust" Estimator

In real life, data is messy. You might not know exactly who got the treatment or why (confounding factors).

The Analogy: Imagine trying to judge a race where some runners started at different times or had different shoes.
The Solution: The authors built a special calculator called a Doubly Robust Estimator.
- It's like having two safety nets.
- If your guess about who got the treatment is wrong, the calculator uses its knowledge of the results to fix it.
- If your guess about the results is wrong, it uses its knowledge of who got the treatment to fix it.
- You only need one of those guesses to be right for the final answer to be accurate. This makes the method incredibly reliable even with messy, real-world data.

5. Real-World Examples

The paper tested this on three cool scenarios:

CT Scans (Lungs): They looked at lung scans of COVID patients. The "shape" of the infection (how the white spots are connected) changed in a way that simple averages missed. Their method detected the structural difference between infected and healthy lungs.
Molecules (Drugs): They simulated a drug that changes a molecule's shape. The old math said "nothing changed." The new math said, "Look! A new loop appeared!"
Dynamical Systems: They tested it on simulated data where the "shape" of the data points shifted, proving the method works even when the data is purely synthetic.

Summary

Think of this paper as inventing a new kind of ruler.

Old Ruler: Measures length, weight, and temperature.
New Ruler (Topological): Measures holes, loops, and connections.

By combining this new ruler with a super-smart, double-safety-net calculator, the authors gave scientists a way to finally ask and answer the question: "Did this treatment change the fundamental shape of the problem?" This is a huge leap forward for fields like medicine, biology, and engineering where structure matters more than simple numbers.

1. Problem Statement

Standard causal inference methods rely on Euclidean summaries (e.g., means, variances) to estimate treatment effects. These methods often fail when outcomes reside in complex, non-Euclidean spaces (e.g., molecular conformations, brain connectivity networks, medical images, or dynamical systems) where the scientifically relevant information lies in the topological structure (e.g., number of connected components, loops, voids) rather than simple geometric shifts.

Existing Topological Data Analysis (TDA) techniques can describe these structures but lack a formal framework for causal inference. Specifically, there is no established method to:

Define a causal estimand directly in terms of topological summaries.
Provide nonparametric estimation and valid statistical inference for these topological effects in observational settings.

2. Methodology

The authors propose a framework for Topological Causal Inference that bridges TDA with semiparametric causal estimation.

A. Topological Tools

Persistent Homology: The method uses persistent homology to summarize how topological features (components, loops, voids) appear and disappear across different scales (filtration parameters).
Persistence Diagrams: Features are represented as birth-death pairs $(a, b)$ in a multiset called a persistence diagram ( $D$ ).
Power-Weighted Silhouette Functions: To make these diagrams amenable to functional analysis, the authors map them into a Hilbert space using silhouette functions.
- A silhouette $\phi(t; D)$ is a normalized weighted average of "tent functions" derived from the points in the diagram.
- Power Weighting: Points are weighted by $(b-a)^r$ , where $r > 0$ . This emphasizes persistent features (long-lived loops/components) while downweighting noise.
- This transforms the complex, non-Euclidean diagram into a functional curve $\phi(t)$ defined over a compact interval $T$ .

B. Causal Estimand: Topological Average Treatment Effect (TATE)

The target parameter is the Topological Average Treatment Effect (TATE), defined as the expected difference in silhouette functions between potential outcomes under treatment ( $A=1$ ) and control ( $A=0$ ):
$\psi_d(t) = E[\phi_{i,d}^1(t) - \phi_{i,d}^0(t)]$
where $d$ is the homology dimension (e.g., $d=1$ for loops). This function captures how treatment alters the topological structure across filtration scales.

C. Estimation Strategy

The authors develop a doubly robust, fully nonparametric estimator based on the Augmented Inverse Probability Weighting (AIPW) approach:

Nuisance Parameters:
- Propensity score: $\pi(x) = P(A=1|X=x)$ .
- Conditional silhouette regression: $\mu_a(t, x) = E[\phi_d(t) | X=x, A=a]$ .
Estimator Construction:
- They utilize an Augmented IPW (AIPW) estimator:
  $\hat{\psi}_{AIPW}(t) = \mathbb{P}_n \left[ \hat{\mu}_1(t, X) - \hat{\mu}_0(t, X) + \left( \frac{A}{\hat{\pi}(X)} - \frac{1-A}{1-\hat{\pi}(X)} \right) (\phi_d(t) - \hat{\mu}_A(t, X)) \right]$
- Double Robustness: The estimator remains consistent if either the propensity score model ( $\hat{\pi}$ ) OR the outcome regression model ( $\hat{\mu}$ ) is correctly specified.
- Sample Splitting: To avoid empirical process restrictions and allow for complex machine learning nuisance estimators (e.g., Random Forests, Neural Networks), the data is split into training and estimation sets (or cross-fitted).

D. Inference and Hypothesis Testing

Weak Convergence: The authors establish that the estimator $\sqrt{n}(\hat{\psi} - \psi)$ converges weakly to a mean-zero Gaussian process in the space of bounded functions $\ell^\infty(T)$ .
Stability Bounds: A key theoretical contribution is a new stability bound showing that the $L_\infty$ distance between two silhouette functions is bounded by the Wasserstein distance ( $W_1$ ) between their underlying persistence diagrams.
Hypothesis Test: They construct a formal test for the null hypothesis of no topological effect ( $H_0: \psi_d(t) = 0$ for all $t$ ). The test statistic is the supremum norm of the estimated effect, with critical values derived via a multiplier bootstrap.

3. Key Contributions

Novel Causal Estimand: First framework to define causal effects directly via topological summaries (silhouettes of persistence diagrams) rather than vectorized Euclidean approximations.
Doubly Robust Nonparametric Estimation: Development of an efficient AIPW estimator that achieves $\sqrt{n}$ -consistency and asymptotic normality under standard product-rate conditions on nuisance errors, even with complex, high-dimensional functional outcomes.
Theoretical Guarantees:
- Proof of weak convergence to a Gaussian process.
- Derivation of new stability bounds for power-weighted silhouettes under Wasserstein perturbations.
- Construction of a valid hypothesis test with asymptotically correct size and consistency.
Empirical Validation: Demonstration on semi-synthetic datasets (SARS-CoV-2 CT scans, GEOM-Drugs molecular graphs, ORBIT point clouds) showing that AIPW outperforms standard Plug-in (PI) and Inverse Probability Weighting (IPW) estimators, particularly in reducing bias and capturing complex curvature.

4. Results

Simulation Studies:
- In experiments involving CT scans (0-dimensional features) and molecular graphs (1-dimensional loops), the AIPW estimator accurately reconstructed the true topological causal effect.
- IPW tended to overestimate the effect, while PI tended to underestimate it or fail to capture complex shapes.
- AIPW remained robust even when one of the nuisance models (propensity or outcome) was severely misspecified, confirming double robustness.
Hypothesis Testing:
- On the ORBIT dataset, the test correctly rejected the null hypothesis for 1-dimensional homology (where treatment induced new loops) but failed to reject for 0-dimensional homology (where no structural change occurred), aligning with visual ground truth.
Computational Efficiency: While persistent homology is computationally intensive ( $O(N^3)$ generally), the authors demonstrated feasibility on datasets with thousands of samples (e.g., 2,481 CT scans, 30,433 molecular graphs) using optimized libraries (GUDHI).

5. Significance

This work significantly broadens the scope of causal inference by enabling rigorous analysis of structural effects in complex systems.

Domain Impact: It provides a statistical toolkit for fields where "shape" matters more than "size," such as:
- Biomedicine: Analyzing protein folding or CT-scan lesions.
- Neuroscience: Studying changes in brain connectivity networks.
- Material Science: Investigating molecular conformation changes.
Methodological Advancement: It moves beyond heuristic vectorization of topological data, offering a principled, nonparametric approach with valid inference guarantees. This allows researchers to detect intervention-induced changes that are invisible to traditional scalar summaries.