Spatio-Temporal Forecasting of Retaining Wall Deformation: Mitigating Error Accumulation via Multi-Resolution ConvLSTM Stacking Ensemble

This study introduces a multi-resolution ConvLSTM stacking ensemble framework that effectively mitigates error accumulation and enhances the accuracy of long-horizon retaining wall deformation forecasting by integrating models trained on diverse temporal input resolutions.

Jihoon Kim (Department of Civil,Environmental Engineering, Hongik University, Seoul, Republic of Korea), Heejung Youn (Department of Civil,Environmental Engineering, Hongik University, Seoul, Republic of Korea)

Published 2026-03-12
📖 5 min read🧠 Deep dive

Here is an explanation of the paper, translated into simple language with creative analogies.

🏗️ The Big Picture: Predicting the "Wiggle" of a Construction Wall

Imagine you are digging a giant hole in the ground to build a skyscraper's basement. To keep the dirt from collapsing into your hole, you build a giant wall around the perimeter. As you dig deeper, that wall starts to bend or "wiggle" inward.

If you can't predict exactly how much it will wiggle, the wall might crack, or the buildings next door might sink. This is a huge safety risk.

The Problem:
Engineers have two main ways to guess how the wall will behave:

  1. Math Models: Like a complex physics simulation. They are accurate but take forever to run and are hard to tweak.
  2. AI (Artificial Intelligence): Like a super-smart student who learns from past examples. It's fast, but it has a major flaw: The "Whisper Game" effect.

If you ask an AI to predict what happens 10 steps into the future, it usually guesses step 1, then uses that guess to guess step 2, then uses that guess for step 3, and so on. If it makes a tiny mistake on step 1, that mistake gets bigger on step 2, and by step 10, the prediction is completely wrong. This is called error accumulation.

🧩 The Solution: The "Three-Headed Oracle"

The authors of this paper created a new AI system to fix the "Whisper Game" problem. Instead of relying on one AI, they built a team of three AIs that work together, and then a fourth "Manager AI" that decides who to listen to.

Here is how they did it:

1. The Three Specialists (The Base Models)

Imagine you are trying to predict the weather for the next two weeks.

  • Specialist A (The Short-Term Watcher): Only looks at the last 3 days of weather. They are great at spotting sudden storms but might miss the big seasonal trends.
  • Specialist B (The Medium-Term Watcher): Looks at the last 6 days. A good balance.
  • Specialist C (The Long-Term Historian): Looks at the last 10 days. They see the big picture but might be slow to react to sudden changes.

In the paper, these "specialists" are ConvLSTM models. They are a type of AI designed to understand both space (where the wall is bending) and time (how the bending changes as you dig).

2. The Manager (The Stacking Ensemble)

This is the secret sauce. The researchers didn't just average the three specialists' answers. They built a Manager AI (a deep neural network).

Think of the Manager like a conductor in an orchestra.

  • When the weather is stable, the Manager listens mostly to the Long-Term Historian (Specialist C).
  • When the weather suddenly changes (like a storm hitting), the Manager switches focus to the Short-Term Watcher (Specialist A) because they are reacting faster.
  • The Manager learns to combine the "wisdom" of all three to create one perfect prediction.

🎓 How They Trained the AI

You can't train an AI on real construction sites easily because accidents happen, and you don't have enough data. So, the researchers used a Video Game Simulator (called PLAXIS2D).

  • They created 2,000 different "virtual construction sites."
  • They changed the soil type, the depth of the hole, and the strength of the wall randomly in every simulation.
  • This gave the AI a massive library of "what-if" scenarios to learn from, so it wouldn't be surprised by real-world chaos.

🏆 The Results: Who Won the Race?

They tested the system in two ways:

  1. On the Simulator: They checked if the AI could predict the next 10 steps of digging.
  2. On Real Life: They tested it on two actual construction sites in South Korea.

The Verdict:

  • The Single AIs (The Specialists): When asked to predict far into the future, they started to drift. Their predictions became wild guesses, often overestimating or underestimating the wall's movement.
  • The Team (The Ensemble): The Manager AI kept the predictions on track. Even when predicting 10 steps ahead (about 5 weeks of construction), the team was still 90% accurate. The single AIs dropped below 65% accuracy.

💡 The "Aha!" Moment (Why it works)

The researchers used a special tool called SHAP (which is like an X-ray for AI decisions) to see what the Manager was thinking.

They found something fascinating:

  • Early in the prediction: The Manager trusted the Long-Term Historian (Specialist C) the most because the trend was clear.
  • Later in the prediction: As the prediction got further away, the Manager started trusting the Short-Term Watcher (Specialist A) more.

Why? Because in the real world, things change unexpectedly. The Long-Term Historian gets "stuck" in the past and misses sudden shifts. The Short-Term Watcher is sensitive to the now. The Manager learned to switch its trust dynamically, using the right tool for the right moment.

🚀 The Takeaway

This paper proves that a team of diverse AI models is better than a single "super" model.

By combining models that look at different time windows (short vs. long term) and letting a smart Manager decide how much to trust each one, engineers can now predict how deep excavation walls will move weeks in advance with high accuracy. This means safer construction sites, less damage to neighboring buildings, and fewer surprises for the construction crew.

In short: Don't put all your eggs in one basket. Use a team of experts with different perspectives, and hire a smart manager to keep them in sync.