Global Interpretability via Automated Preprocessing: A Framework Inspired by Psychiatric Questionnaires

⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are trying to predict how a patient's mood will change over the next year based on a questionnaire they fill out today. The problem is that these questionnaires are messy. A patient might have a bad day, a rater might be tired, or the questions might be interpreted differently depending on who is asking. This "noise" makes it hard to see the true pattern.

Traditionally, doctors and data scientists have faced a tough choice:

Use a simple, clear model: Easy to understand, but often inaccurate because it can't handle the messy, complex reality of human emotions.
Use a complex, "black box" AI: Very accurate at predicting the future, but impossible to explain. If a doctor can't understand why the AI made a prediction, they won't trust it with a patient's life.

This paper introduces a clever new method called REFINE that solves this dilemma. It's like hiring a professional editor before you write your story.

The Core Idea: The "Editor" and the "Translator"

Think of the process of predicting a patient's future symptoms as a two-step journey. REFINE splits this journey into two distinct roles:

1. The Editor (The Preprocessing Step)

Imagine you have a rough draft of a story written by a nervous teenager. It's full of typos, rambling sentences, and emotional outbursts that don't fit the plot. You need a professional Editor to clean it up.

What REFINE's "Editor" does: It looks at the messy questionnaire data and uses a powerful, flexible AI (like a smart neural network) to "denoise" it. It figures out what is just a temporary glitch (like a bad day) and what is the stable, true signal (the actual symptoms).
The Magic: This Editor is allowed to be as complex and "black box" as it wants. Its only job is to clean the data. It doesn't have to explain how it cleaned it; it just has to make the data clean and stable.

2. The Translator (The Prediction Step)

Once the Editor has produced a clean, polished version of the story, you hand it to a Translator.

What REFINE's "Translator" does: This translator is very simple. It only speaks "Linear Math." It looks at the clean data and draws a straight line to predict the future.
The Benefit: Because the data is already clean, this simple translator can be 100% accurate. And because it's a simple linear model, a doctor can look at it and say, "Ah, I see. If symptom A goes up by 1 point, symptom B is expected to go down by 2 points." It is globally interpretable—the rules are the same for every single patient.

Why This is a Game-Changer

Most other methods try to make the whole system simple (which loses accuracy) or make the whole system complex and then try to guess what it's thinking afterwards (which is confusing).

REFINE says: "Let the complex part do the dirty work of cleaning, and let the simple part do the explaining."

The "Psychiatric Questionnaire" Problem

In fields like MRI scans or DNA sequencing, scientists already do this. They have special tools to clean up the raw images or DNA strands before analyzing them. But psychiatric questionnaires are different; they don't have "pixels" or "genes" to clean. They are just words and numbers.

The authors realized that even though questionnaires are messy, they have a secret weapon: Redundancy.

If a patient reports "sadness" today, and they report "sadness" again in two weeks, that's a real signal.
If they report "sadness" today but "happiness" in two weeks, the "sadness" today might have been a fluke or a measurement error.

REFINE uses this redundancy. It learns to "stabilize" the answers. It asks, "What part of this answer today is likely to stay the same in two weeks?" It strips away the fluke and keeps the truth.

A Real-World Analogy: The Noisy Radio

Imagine you are trying to predict the weather based on a radio broadcast that is full of static (noise).

The Old Way: You try to guess the weather while listening to the static. You might get it right sometimes, but you can't explain your logic because the static is confusing.
The REFINE Way:
1. Step 1 (The Filter): You run the radio signal through a high-tech noise-canceling filter. This filter is complex and uses advanced math to remove the static. You don't need to understand how the filter works; you just know the output is clear.
2. Step 2 (The Forecast): Now you have a crystal-clear signal. You use a simple, transparent rule (e.g., "If the temperature is 70°F, it will rain") to predict the weather. Because the signal is clear, your simple rule works perfectly, and anyone can understand it.

The Results

The authors tested REFINE on real data from patients with depression and psychosis.

Accuracy: It predicted future symptoms just as well as the complex "black box" models.
Trust: Unlike the black boxes, doctors could look at REFINE's results and understand exactly which symptoms were driving the prediction.
Speed: It was surprisingly fast, taking only seconds to run.

The Bottom Line

REFINE is a framework that says: "Don't try to make the whole AI simple. Instead, use a smart AI to clean the data, and then use a simple, transparent rule to make the prediction."

It's like hiring a master chef (the complex AI) to prep the ingredients perfectly, so that a home cook (the simple linear model) can follow a clear recipe to make a perfect meal. The result is a dish that tastes great (accurate) and is easy to understand (interpretable).

1. Problem Statement

The paper addresses a critical challenge in clinical machine learning, specifically regarding psychiatric questionnaires (e.g., HAM-D, PANSS):

The Prediction Challenge: Psychiatric symptoms are noisy, context-sensitive, and follow complex, nonlinear trajectories over time. Accurate forecasting of future symptom severity often requires flexible nonlinear models.
The Interpretability Challenge: While nonlinear models (like neural networks or gradient boosting) improve accuracy, they lack global interpretability. Clinicians cannot easily trust "black box" models, and standard post-hoc explanation tools (like SHAP) provide only local (patient-specific) attributions. These local explanations often vary wildly across patients and symptom dimensions, making it difficult to extract a coherent, global understanding of prognostic factors.
The Gap: Existing methods either sacrifice accuracy for interpretability (using simple linear models) or sacrifice global interpretability for accuracy (using complex nonlinear models with local explanations). There is a lack of methods that achieve high predictive accuracy while maintaining a transparent, global linear structure for the prognostic relationship.

2. Methodology: The REFINE Framework

The authors propose REFINE (Redundancy-Exploiting Follow-up-Informed Nonlinear Enhancement), a two-stage framework that decouples preprocessing from prediction.

Core Philosophy

Inspired by preprocessing in imaging and omics (where artifacts are removed to stabilize signals before fitting linear models), REFINE concentrates all nonlinearity into an automated preprocessing step. The prognostic mapping from the preprocessed data to future outcomes remains exactly linear and globally interpretable.

The Two-Stage Process

Nonlinear Preprocessing (Stabilization):
- Goal: To learn a transformation $h_t(X_0, Z)$ that converts baseline questionnaire items ( $X_0$ ) and covariates ( $Z$ ) into "stabilized" item values.
- Mechanism: The model uses follow-up measurements ( $X_t$ ) as "privileged information" during training. It first learns a linear reconstruction matrix $B_t$ that maps follow-up items back to baseline items ( $X_0 \approx X_t B_t$ ).
- Target: The target for the nonlinear learner is the proxy $\tilde{X}^{(t)}_0 = X_t B_t$ . This proxy represents the component of the baseline items that is reproducible and predictable from future assessments (longitudinal redundancy).
- Learning: A flexible nonlinear learner (e.g., Random Forests) is trained to predict this proxy $\tilde{X}^{(t)}_0$ from the raw baseline inputs $(X_0, Z)$ . This yields the preprocessing map $h_t$ .
Linear Prediction (Decoding):
- Goal: To predict future symptoms $X_t$ from the stabilized representation.
- Mechanism: The model applies a linear decoder $\beta_t$ to the output of the preprocessor.
- Derivation: The decoder is defined as the inverse of the reconstruction matrix: $\beta_t = B_t^{-1}$ .
- Result: The final predictor is $X_t = h_t(X_0, Z) \beta_t$ . Because $h_t$ is learned to recover the conditional mean of the proxy, and $\beta_t$ inverts the proxy back to the target, the entire system is Bayes-optimal for predicting the conditional mean $E[X_t | X_0, Z]$ .

Key Constraints

The framework enforces two minimal criteria for the preprocessor:

Longitudinal Redundancy: The preprocessor must extract signal that is reproducible over time, suppressing visit-specific noise (e.g., rater bias, context shifts).
Item-Level Meaning Preservation: The output of the preprocessor must remain aligned with the original item coordinates. Coordinate $j$ of the output must correspond to a stabilized version of item $j$ , ensuring that the linear coefficients $\beta_t$ can be interpreted directly in terms of specific questionnaire items.

3. Key Contributions

Formalization of Preprocessing Criteria: The authors define two principled requirements for clinically interpretable preprocessing: preserving longitudinal redundancy and maintaining item-wise alignment.
The REFINE Algorithm: A novel method that learns a nonlinear preprocessor using follow-up data as supervision, then applies a linear decoder derived via matrix inversion.
Theoretical Guarantees:
- Uniqueness: Under the constraint of item alignment, the Bayes-optimal "preprocess-then-linear-predict" pipeline is unique and coincides with REFINE.
- Optimality: The method achieves Bayes-optimal predictive performance.
- Convergence: The end-to-end error converges at the rate determined by the nonlinear learner used for preprocessing, while the linear decoder retains standard parametric ( $n^{-1/2}$ ) convergence rates.
Global Interpretability: Unlike other methods that rely on aggregating local attributions, REFINE provides a single, unified coefficient matrix ( $\beta_t$ ) that describes the global prognostic relationship across all patients and symptoms.

4. Empirical Results

The authors evaluated REFINE on three datasets:

NAPLS-3 (Psychosis): Predicting prodromal psychosis symptoms (SOPS) over 24 months.
STAR*D (Depression): Predicting depressive symptoms (QIDS-SR) during citalopram treatment.
Adolescent Health (Non-psychiatric): Predicting anthropometric and physiological measures.

Comparators:

AICNN: A neural network with reconstruction loss (similar concept but different optimization).
GPBoost & XGBoost: Nonlinear models with TreeSHAP for local attribution.
MGCV: Generalized Additive Models.
Ablations: Linear-only versions of REFINE.

Findings:

Predictive Accuracy: REFINE achieved the highest or near-highest forward correlation (prediction accuracy) across all datasets and time points, outperforming standard interpretable baselines.
Longitudinal Stability: REFINE achieved the highest backward correlation, meaning the stabilized representations it learned were most effective at reconstructing baseline signals from future data.
Interpretability (Item Alignment): REFINE showed the highest cosine similarity between the contribution matrix and the diagonal. This indicates that baseline item $j$ primarily influences follow-up item $j$ , preserving the semantic meaning of the questionnaire items better than other methods.
Efficiency: REFINE was the fastest method in the comparison (seconds to run), significantly faster than deep learning or complex boosting approaches.

5. Significance and Impact

Bridging the Gap: REFINE successfully bridges the gap between the need for flexible nonlinear modeling (to handle complex symptom trajectories) and the clinical need for global interpretability.
New Paradigm for Clinical AI: It shifts the paradigm from "post-hoc explanation" to "pre-hoc stabilization." By moving nonlinearity into a preprocessing step, the prognostic model itself remains transparent and linear.
Generalizability: While motivated by psychiatry, the framework is applicable to any domain involving repeated measurements where variables evolve over time (e.g., endocrinology, gastroenterology), provided the goal is to predict the evolution of a vector of correlated variables.
Clinical Trust: By providing a single, stable coefficient matrix that applies to all patients, REFINE offers a level of transparency that facilitates clinical trust and decision-making, which is often eroded by patient-specific, fluctuating explanations from black-box models.

In summary, the paper demonstrates that explicitly learning a task-aligned preprocessing operator allows for the construction of models that are both Bayes-optimal in prediction and globally interpretable in their prognostic relationships, without requiring heuristic aggregation of local explanations.