EARCP: Self-Regulating Coherence-Aware Ensemble Architecture for Sequential Decision Making -- Ensemble Auto-Regule par Coherence et Performance

This paper introduces EARCP, a novel self-regulating ensemble architecture that dynamically weights heterogeneous expert models by balancing individual performance and inter-model coherence through an online learning mechanism, offering theoretical regret guarantees and demonstrating robustness in non-stationary sequential decision-making tasks.

Mike Amega

Published 2026-03-17
📖 4 min read☕ Coffee break read

Imagine you are the captain of a ship navigating through a stormy, ever-changing ocean. You have a crew of four different navigators, each with their own unique style:

  1. The Historian: Great at reading old maps and spotting patterns from the past.
  2. The Meteorologist: Excellent at reading the wind and clouds right now.
  3. The Astronomer: Good at looking far ahead at the stars for long-term direction.
  4. The Mechanic: Knows exactly how the ship's engine is behaving and can predict mechanical failures.

The Problem with Traditional Teams

In most traditional "ensemble" teams (groups of experts), the captain picks a fixed rule: "I will listen to the Historian 25%, the Meteorologist 25%, and so on."

This works fine if the weather stays the same. But what if the Historian's old maps become useless because the coastline has changed? Or what if the Meteorologist is great at rain but terrible at fog? A fixed rule fails because it can't adapt when the world changes.

The Solution: EARCP (The "Smart Captain")

The paper introduces EARCP, a new way to run this team. Instead of a fixed rule, EARCP is a self-regulating, smart captain that constantly adjusts how much it listens to each navigator based on two things:

  1. How well they are doing right now (Performance).
  2. Whether they agree with each other (Coherence).

Here is how it works in simple terms:

1. The Scorecard (Performance)

Every time the ship makes a turn, the captain checks: "Did the Historian's prediction match where we actually ended up?"

  • If the Historian was right, their score goes up.
  • If they were wrong, their score goes down.
  • This is like a teacher grading a student on a test.

2. The Group Chat (Coherence)

This is the secret sauce. EARCP doesn't just look at who is right; it looks at who agrees with whom.

  • Imagine the Historian says, "Turn left!" and the Meteorologist says, "Turn left!" but the Astronomer says, "Turn right!"
  • EARCP sees that two experts agree. It thinks, "Okay, even if I'm not 100% sure who is right, the fact that two of them agree gives me confidence."
  • If everyone is shouting different things, the captain knows it's a "foggy" situation and becomes more cautious, perhaps listening to everyone a little bit more equally to avoid disaster.

3. The "Floor" Rule (Exploration)

Sometimes, a navigator might be having a bad day and get a low score. In a normal system, the captain might stop listening to them entirely.

  • EARCP has a safety rule: "No one gets fired completely."
  • It ensures every navigator gets at least a tiny bit of attention (a "floor"). Why? Because the Historian might be bad today but could be the only one who knows how to navigate a sudden storm tomorrow. This keeps the team ready for surprises.

Why is this better than the old ways?

  • Old Way (Static): Like a robot that follows a script. If the script says "Trust the Historian," it trusts them even when the Historian is wrong.
  • Old Way (Online Learning): Like a student who only cares about their own test score. They might ignore the fact that the whole class is confused, leading to bad decisions.
  • EARCP: Like a wise leader who says, "You did well yesterday, but today you're struggling. Also, you and the Meteorologist agree, so I'll trust you a bit more. But I won't ignore the Astronomer completely, just in case."

Where can we use this?

The paper suggests this "Smart Captain" logic works anywhere the world changes quickly and we need to make decisions:

  • Stock Markets: Instead of just trusting one trading algorithm, EARCP balances them. If the "Tech Stock" algorithm starts failing but agrees with the "Crypto" algorithm, it might adjust weights to handle the volatility.
  • Medical Diagnosis: Imagine a system with an AI for X-rays, an AI for MRIs, and an AI for blood tests. If the X-ray AI is confused but the MRI and Blood Test AIs agree on a diagnosis, EARCP leans on that consensus.
  • Self-Driving Cars: If the camera sensor is blinded by rain but the radar and GPS agree on the path, the car trusts the agreement rather than panicking.

The Bottom Line

EARCP is a framework that builds a team of AI experts that learns how to work together in real-time. It doesn't just ask, "Who is the smartest?" It asks, "Who is doing well right now, and who is agreeing with the group?"

By balancing individual performance with group agreement, and ensuring no expert is ever completely ignored, it creates a decision-making system that is robust, adaptable, and much harder to fool than a single model or a rigid team.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →