Probing forced responses and causality in data-driven climate emulators: conceptual limitations and the role of reduced-order models

This paper argues that data-driven climate emulators often fail to capture causal forced responses due to inadequate coarse-grained representations and parameterizations, advocating instead for tailored reduced-order stochastic models guided by linear response theory to better resolve multiscale dynamics and enable causal studies.

Original authors: Fabrizio Falasca

Published 2026-01-27
📖 5 min read🧠 Deep dive

Original authors: Fabrizio Falasca

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine the Earth's climate as a giant, chaotic orchestra. It has thousands of instruments playing at once, from the deep, slow rumble of ocean currents to the rapid, high-pitched chirping of daily weather. For decades, scientists have tried to build a "digital twin" of this orchestra using artificial intelligence (AI) to predict how it will sound in the future.

This paper, written by Fabrizio Falasca, asks a critical question: Just because an AI can perfectly mimic the orchestra's current sound, does it actually understand how the music will change if we suddenly change the conductor's tempo?

Here is a breakdown of the paper's findings using simple analogies.

1. The Problem: The "Perfect Mimic" vs. The "True Understanding"

Current AI climate models are like incredibly talented parrots. If you play them a recording of the climate, they can repeat the sounds (the statistics) almost perfectly. They can tell you what the average temperature is or how much rain usually falls.

However, the paper argues that these "parrots" often fail when you ask them a "what if" question. If you tell the AI, "What happens if the ocean gets warmer in a specific pattern?" the AI might guess the wrong answer. It mimics the past but doesn't understand the causes. In scientific terms, it captures "stationary statistics" (the average state) but fails at "forced responses" (how the system reacts to change).

2. The Test: The Three-String Instrument

To prove this, the authors didn't start with the massive, complex Earth. Instead, they built a tiny, simplified "instrument" with just three strings (variables) that mimics the physics of the real climate.

  • The Setup: They let this instrument play for a very long time so the AI could learn its song.
  • The Test: They then gave the instrument a tiny "tap" (a perturbation) and asked the AI to predict how the sound would change.

The Results:

  • The Linear Model (The Simple AI): This model was like a basic metronome. It could predict the average rhythm well, but if you tapped the instrument, it couldn't predict how the loudness (variance) would change. It was too rigid.
  • The Neural Model (The Smart AI): This model was much better. It could predict both the rhythm and the changes in loudness. It learned the "rules" of the instrument well enough to handle the tap.

The Catch: This success only happened because the AI had access to all three strings. It saw the whole instrument.

3. The Real-World Problem: The "Blind" Musician

In the real world, we are like blind musicians. We cannot see the entire climate system. We only see a few "strings" (like surface temperature) while the rest of the orchestra (deep ocean currents, tiny atmospheric swirls) is hidden from us.

The paper shows that when the AI only sees one string:

  • It can still learn to mimic the sound of that one string.
  • But, it often fails to predict how that string will react to a tap.

Why? Because the hidden strings are pushing and pulling the one we can see. If the AI doesn't know those hidden strings exist, it tries to explain the movement using only the visible string, leading to wrong predictions about cause and effect.

To fix this, the authors suggest two things:

  1. Choose the right string: You must pick the "slow" string (the one that matters most) rather than a fast, noisy one.
  2. Add "Ghost Noise": Since the AI can't see the hidden strings, it needs to be told that "invisible forces" are pushing the system. The authors found that adding a specific type of "noise" (randomness that changes based on the current state) helped the AI understand the hidden forces much better.

4. The Real-World Application: The "Pattern Effect"

The authors took these lessons and applied them to a real climate mystery called the "Pattern Effect."

  • The Mystery: The Earth's energy balance doesn't just depend on how much the ocean warms, but where it warms. Warming the Eastern Pacific might make the Earth hotter, while warming the Western Pacific might cool it down.
  • The Experiment: They built a specialized, simplified AI model that only looked at the "main patterns" of ocean temperature and the energy leaving the Earth (radiative flux).
  • The Success: By focusing on the big picture (coarse-graining) and adding the right "ghost noise," their AI successfully recreated the complex physics. It could predict how the Earth's energy balance would change if the ocean warmed in specific patterns. It even produced a map showing exactly where warming causes heating and where it causes cooling, matching what complex physics models say.

5. The Big Takeaway

The paper concludes that we shouldn't just build "general-purpose" AI that tries to learn everything about the climate at once. That approach is like trying to learn a symphony by listening to every single instrument simultaneously without a conductor's score—it's too messy.

Instead, we should build specialized, simplified models (Reduced-Order Models) that:

  1. Focus on the specific question we want to answer.
  2. Use "coarse-graining" to ignore the tiny, fast details and focus on the big, slow patterns.
  3. Use "stochastic" (random) elements to account for the invisible parts of the system we can't see.

By doing this, and by testing these models not just on how well they mimic the past, but on how well they predict the future when "tapped," we can build climate tools that truly understand cause and effect.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →