Algebraic Structure Discovery for Real World Combinatorial Optimisation Problems: A General Framework from Abstract Algebra to Quotient Space Learning

This paper proposes a general framework that leverages abstract algebra to identify and formalize hidden structures in combinatorial optimization problems, enabling the construction of quotient spaces that collapse redundant representations and significantly improve the efficiency and success rate of finding global optima compared to standard approaches.

Min Sun (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development), Federica Storti (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development), Valentina Martino (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development), Miguel Gonzalez-Andrades (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development), Tony Kam-Thong (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development)

Published 2026-04-08
📖 5 min read🧠 Deep dive

Imagine you are trying to find the perfect recipe for a cake. You have a massive pantry with thousands of ingredients (sugar, flour, eggs, spices, etc.). Your goal is to mix them together to create a cake that tastes the absolute best.

The problem? There are trillions of possible combinations. If you tried to bake every single one to see which is best, you'd be in the kitchen for a million years. This is what computer scientists call a "combinatorial optimization problem."

Most people try to solve this by baking randomly, or by following a strict checklist. But this paper proposes a clever new way: Stop looking at the ingredients as a list, and start looking at them as a mathematical system.

Here is the paper's idea, broken down into simple concepts and analogies.

1. The "Super Mario" Connection

The authors noticed that finding the best group of patients (or molecules) is surprisingly similar to playing Super Mario.

  • In Super Mario: You have basic moves: Jump, Run Left, Run Right, Go Down. You combine them to reach a goal. Interestingly, "Jump then Run" might get you to the same spot as "Run then Jump" in some contexts, or they might be functionally the same even if the button sequence looks different.
  • In Medicine/Science: You have basic rules like "Age > 65," "Blood Pressure < 120," or "Has Diabetes." You combine them with "AND" logic to find a specific group of people.

The paper argues that just like Mario's moves follow hidden mathematical laws, these medical rules follow hidden algebraic laws.

2. The "Duplicate Folder" Problem

Imagine you are organizing a massive library of books.

  • Rule A: "Books about Cats."
  • Rule B: "Books about Cats AND Books about Cats."

Mathematically, Rule A and Rule B are identical. They pick out the exact same books. But a computer looking at them as text strings sees them as different. It wastes time checking both.

The authors discovered that many different-looking rules actually produce the exact same result.

  • Rule 1: "Age > 50 AND Smoker = Yes"
  • Rule 2: "Smoker = Yes AND Age > 50"

These are the same. The paper calls these Equivalence Classes. Think of them as "Duplicate Folders."

3. The Magic Trick: The Quotient Space

This is the core of the paper. Instead of searching through the entire messy library (the huge search space), the authors suggest building a Quotient Space.

The Analogy:
Imagine you have 1,000 folders, but 900 of them are just duplicates of 100 unique folders.

  • The Old Way: You open every single one of the 1,000 folders to find the best document.
  • The New Way (Quotient Space): You realize the duplicates don't matter. You create a "Smart Filing System" that automatically groups all duplicates together and gives you one representative from each group. Now, instead of opening 1,000 folders, you only open 100.

This "Smart Filing System" is the Quotient Space. It collapses all the redundant, duplicate rules into single, unique representatives.

4. How They Did It (The "Binary Switch" Trick)

To make this work on a computer, they translated the rules into a language computers love: Binary Code (0s and 1s).

  • Imagine a row of light switches.
  • Switch 1 = "Age > 50" (On = 1, Off = 0)
  • Switch 2 = "Smoker" (On = 1, Off = 0)
  • Switch 3 = "Height > 6ft" (On = 1, Off = 0)

When you combine rules with "AND," the computer treats it like flipping switches. The authors proved that this system behaves exactly like a Boolean Hypercube (a multi-dimensional cube of switches).

Because they know the math behind these switches, they can use a special type of algorithm (a "Structure-Aware Genetic Algorithm") that knows: "Hey, I don't need to check every single combination. I just need to check one from each group of duplicates."

5. The Results: Why It Matters

The team tested this on real medical data (finding specific groups of patients with dry eye disease) and synthetic data.

  • The Standard Approach: The computer tried random combinations. It found the "perfect" solution only 35% to 37% of the time.
  • The New Approach (Quotient Space): The computer used the "Smart Filing System." It found the "perfect" solution 48% to 77% of the time.

The Takeaway:
By realizing that many rules are just "duplicates" in disguise, and by organizing the search space to ignore those duplicates, they made the computer smarter and faster.

Summary in One Sentence

This paper teaches us that many complex problems (like finding the best patient group or the best drug) have hidden mathematical patterns; if we organize our search to ignore "duplicate" solutions and focus only on unique ones, we can find the best answer much faster and more reliably.

It's like realizing you don't need to taste every single spoonful of soup to find the saltiest one; you just need to taste one spoonful from every distinct bowl, because some bowls are just copies of others.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →