GALACTIC: Global and Local Agnostic Counterfactuals for Time-series Clustering

This paper introduces GALACTIC, a unified framework that bridges local and global counterfactual explainability for unsupervised time-series clustering by generating minimal perturbations to cross cluster boundaries and employing a provably efficient submodular optimization algorithm to derive concise, non-redundant global summaries of these transitions.

Christos Fragkathoulas, Eleni Psaroudaki, Themis Palpanas, Evaggelia Pitoura

Published 2026-03-06
📖 4 min read☕ Coffee break read

Imagine you have a massive library of thousands of different songs (time-series data). You want to organize them into genres like "Jazz," "Rock," and "Classical" without knowing the labels beforehand. You use a smart computer program to sort them. It works great, but there's a problem: nobody knows why a specific song ended up in the "Jazz" pile instead of the "Rock" pile.

The computer just says, "It's Jazz." But a human wants to know: "What if I changed the drum beat? Would it become Rock?" or "What is the one thing that makes this group of songs feel like Jazz?"

This paper introduces GALACTIC, a new tool that acts like a time-traveling music critic to answer those questions. It explains time-series data (like heartbeats, stock prices, or robot movements) in two ways: for individual items and for the whole group.

Here is how it works, using simple analogies:

1. The Problem: The "Black Box" Clustering

Think of your data as a giant bag of mixed-up marbles. You dump them into a machine that sorts them into red, blue, and green piles.

  • The Issue: You can see the piles, but you don't know the machine's secret rule. Why did this specific marble go into the red pile?
  • The Old Way: Previous tools tried to explain this by pointing at the whole marble and saying, "It's red because it's shiny." But that's vague. You need to know exactly which part of the marble to change to make it blue.

2. The Local Solution: The "What-If" Mirror

GALACTIC starts by looking at one single marble (one time-series instance).

  • The Goal: It asks, "What is the smallest, easiest change I can make to this marble to move it to the Blue pile?"
  • The Magic Trick (Structural Awareness): Imagine the marble is actually a long, flexible snake made of different colored segments.
    • Old tools would just randomly paint parts of the snake blue, even the parts that don't matter, making the snake look weird and unrealistic.
    • GALACTIC is smarter. It knows that for a "Jazz" song, the drums matter, but the background noise doesn't. It focuses its changes only on the critical parts (the drums). It leaves the rest of the snake alone.
    • Result: It gives you a "Counterfactual": "If you just changed the drum beat at second 10, this song would be Rock." It's a minimal, realistic edit.

3. The Global Solution: The "Summary Book"

Now, imagine you have 1,000 "Jazz" marbles. If you ask GALACTIC to explain all 1,000 of them individually, you get 1,000 different answers. That's too much information! It's like reading 1,000 different travel guides for the same city.

  • The Goal: GALACTIC needs to summarize the entire group. It wants to find the top 3 or 4 rules that explain why most Jazz songs are Jazz.
  • The Magic Trick (The MDL Principle): This is where the paper gets fancy with a concept called Minimum Description Length (MDL).
    • Think of MDL as a compression algorithm for explanations.
    • GALACTIC tries to find the shortest "story" that explains the most marbles.
    • If it finds a rule that explains 90% of the marbles with just one sentence, it picks that. If it needs 100 sentences to explain the last 10%, it might decide that's too much effort and stop.
    • It uses a mathematical guarantee (Submodularity) to ensure it finds the best summary without checking every single possibility (which would take forever).

4. Why is this better than the competition?

  • Old Tools: They were like a blunt hammer. They would smash the whole marble to change its color, or they would give you a summary that was too long and confusing.
  • GALACTIC: It's like a scalpel.
    • Locally: It cuts only the specific thread that holds the song in the "Jazz" category, leaving the rest of the song intact.
    • Globally: It writes a concise "Cliff's Notes" version of the genre, telling you the essential features that define the group, ignoring the noise.

The Big Picture

GALACTIC bridges the gap between "I know the computer sorted this" and "I understand why and how to change it."

  • For a Doctor: "Your heart rate looks like 'Group A'. If you slow down the rhythm only during the night, it would look like 'Group B' (which is healthier)."
  • For a Banker: "This stock pattern is 'Group A' (Risky). If you smooth out the spikes only on Tuesday mornings, it becomes 'Group B' (Safe)."

In short, GALACTIC turns a mysterious black box into a transparent, interactive guide that tells you exactly what to tweak to change the outcome, without breaking the reality of the data.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →