Unsupervised Representation Learning - an Invariant Risk Minimization Perspective

The Big Picture: Teaching a Robot to See the "Real" Thing

Imagine you are trying to teach a robot to recognize cats.

Scenario A (The Old Way): You show the robot 1,000 photos of cats. In every photo, the cat is sitting on a green grassy lawn. The robot learns: "Cats = Green Grass + Fluffy Thing."
The Problem: When you show the robot a cat sitting on a red carpet inside a house, it panics. It says, "No cat here! The grass is missing!" It failed because it learned the background (the environment) instead of the subject (the invariant truth).

In the world of AI, this is called distribution shift. The "environment" (grass vs. carpet) changed, and the robot broke.

Invariant Risk Minimization (IRM) is a technique designed to fix this. It tries to teach the robot to ignore the background and focus only on the cat. However, traditionally, IRM needed labels (a human telling the robot, "Yes, that is a cat") to work.

This Paper's Big Idea:
The authors, Yotam Norman and Ron Meir, asked: "What if we don't have labels? What if we just have a pile of photos from different environments, and we don't know which is which?"

They created a new framework that allows AI to learn what is "real" (invariant) and what is just "noise" (environmental) without needing a teacher to grade its homework.

The Two New Tools

The paper introduces two methods to solve this puzzle, depending on how complex the data is.

1. PICA: The "Mathematical Filter" (For Simple Data)

The Analogy: Imagine you have two jars of mixed-up colored marbles.

Jar 1 (Environment A): Mostly red marbles, but a few blue ones.
Jar 2 (Environment B): Mostly blue marbles, but a few red ones.

You want to find the "true" pattern that exists in both jars, ignoring the fact that one jar is red-heavy and the other is blue-heavy.

PICA (Principal Invariant Component Analysis) is like a smart sieve. It looks at the math behind the marbles (specifically, how they vary). It calculates:

What is different between Jar 1 and Jar 2? (The "environmental" noise).
What is the same? (The "invariant" signal).

It then filters out the differences and keeps only the shared direction. It's a linear, mathematical way to strip away the "flavor" of the environment to find the core truth.

2. VIAE: The "Split-Brain Artist" (For Complex Data)

The Analogy: Imagine a master chef (the AI) who needs to cook a dish (generate an image) based on two ingredients:

Ingredient A (Invariant): The recipe (e.g., "It's a burger"). This must stay the same no matter where you are.
Ingredient B (Environmental): The local spices (e.g., "It's a burger in Texas" vs. "It's a burger in Tokyo"). This changes based on the location.

VIAE (Variational Invariant Autoencoder) is a deep learning model that acts like a chef with a split brain:

Brain 1 (The Invariant Encoder): Looks at the raw data and tries to extract only the recipe (the burger shape). It ignores the spices.
Brain 2 (The Environmental Encoders): There is one of these for every environment. They look at the data and extract only the spices (the Texas style vs. Tokyo style).
The Decoder (The Cook): Takes the Recipe + The Spices and reconstructs the image.

Why is this cool?
Because the "Recipe" part is separated from the "Spices" part, you can do magic tricks:

Style Transfer: You can take a photo of a "Texas Burger" (Input), strip out the Texas spices, and add "Tokyo spices" to it. The result is a "Tokyo Burger" that looks exactly like the original burger, just with a different vibe.
No Labels Needed: The AI figures out which part is the recipe and which is the spice just by looking at many different environments, without anyone telling it "This is a burger."

How They Tested It (The Experiments)

The team tested their ideas on three types of puzzles:

Synthetic Data: Made-up math problems where they knew the answer. PICA worked perfectly, proving the math holds up.
SMNIST & SCMNIST (Modified Digits):
- They took handwritten numbers (0-9) and added fake "spurious" features.
- Example: In Environment 1, all numbers had a white square in the top-left corner. In Environment 2, the square was in the bottom-right.
- The Result: The AI learned to ignore the square's position and focus only on the number itself. It could recognize a "7" even if the square was in a new place it had never seen before.
CelebA (Celebrity Faces):
- They treated Gender (Male/Female) as the "Environment" and Facial Features (nose shape, smile, expression) as the "Invariant" truth.
- The Result: The AI could take a photo of a man, strip out the "male" environmental features, and swap them for "female" features, resulting in a woman who still looked like the original man (same smile, same face shape). This is huge for fairness, as it shows the AI can separate identity from sensitive attributes like gender or race.

Why Does This Matter?

No Labels Required: Usually, to train AI to be robust, you need thousands of labeled examples. This method works with unlabeled data, which is much cheaper and easier to get.
Robustness: It helps AI survive in the real world where conditions change (e.g., a self-driving car seeing rain instead of sun, or a medical scanner using a different machine).
Fairness: By separating "sensitive" traits (like race or gender) from "relevant" traits (like qualifications or medical symptoms), we can build AI that makes fairer decisions.

The Bottom Line

This paper is like giving AI a pair of X-ray glasses. Instead of seeing the surface details that change from place to place (the environment), the AI learns to see the underlying skeleton that stays the same (the invariant truth). It does this without needing a human to point and say, "That's the skeleton!" It figures it out on its own.

This opens the door to smarter, fairer, and more adaptable AI that can handle the messy, changing real world without breaking a sweat.

1. Problem Definition

Invariant Risk Minimization (IRM) is a framework designed to learn robust models that generalize across different environments (domains) by identifying latent features that remain stable (invariant) while filtering out environment-specific (spurious) features.

The Limitation: Traditional IRM relies heavily on labeled data ( $X, Y, e$ ) to define the invariance constraint (i.e., the predictor $w \circ \phi$ must be optimal for every environment).
The Gap: In many real-world scenarios, labels are unavailable or expensive to obtain. Existing unsupervised methods often lack a formal mechanism to enforce invariance across environments without target variables.
The Goal: The authors propose extending IRM to an unsupervised setting. The objective is to learn a feature representation $\phi(X)$ such that the distribution of the features is identical across all environments ( $P^{e_1}(\phi(X)) = P^{e_2}(\phi(X))$ ), without access to labels $Y$ .

2. Methodology

The paper introduces a novel Unsupervised Structural Causal Model (SCM) and two specific algorithms to solve the problem: PICA (linear/Gaussian) and VIAE (deep generative).

A. Theoretical Framework: Unsupervised IRM

The authors redefine the IRM objective for unsupervised learning. Instead of minimizing risk on a predictor, they maximize the log-likelihood of the data under a generative model, subject to a constraint that the induced distribution of the learned features is invariant across environments.

Objective:
$\max_{\theta} \sum_{e \in E_{train}} \log P^e_\theta(X|\phi(X))P^e_\theta(\phi(X))$
$\text{s.t. } P^i_\theta(\phi(X)) = P^j_\theta(\phi(X)) \quad \forall i, j \in E_{train}$
Causal Assumption: The latent space is decomposed into:
- $Z_{inv}$ : Invariant features (stable across environments).
- $Z_e$ : Environment-dependent features (vary across environments).
- The causal graph assumes $Z_{inv} \perp \perp e$ and $Z_{inv} \perp \perp Z_e$ .

B. Method 1: Principal Invariant Component Analysis (PICA)

PICA is a linear dimensionality reduction technique tailored for Gaussian data, analogous to PCA but with an invariance constraint.

Assumptions: Data in each environment $e$ follows a Gaussian distribution $N(0, \Sigma^e_x)$ .
Mechanism:
1. Null Space Identification: It finds the null space of the difference between covariance matrices of two environments: $u \in \ker(\Sigma^1_x - \Sigma^2_x)$ . This ensures the projection direction $u$ is invariant to the distribution shift.
2. Variance Maximization: Within this invariant subspace, it selects the direction that maximizes the sum of variances across environments (similar to standard PCA).
Algorithm: A two-step procedure involving eigendecomposition of the sum of covariances restricted to the null space of their difference.

C. Method 2: Variational Invariant Autoencoder (VIAE)

VIAE is a deep generative model based on Variational Autoencoders (VAE) designed to learn non-linear invariant representations.

Architecture:
- Shared Invariant Encoder: Maps input $X$ and environment-specific latent $Z_e$ to $Z_{inv}$ .
- Environment-Specific Encoders: One encoder per training environment to map $X$ to $Z_e$ .
- Shared Decoder: Reconstructs $X$ from $(Z_{inv}, Z_e)$ .
Key Constraints:
- The prior for $Z_{inv}$ is standard Gaussian $N(0, I)$ (invariant).
- The prior for $Z_e$ is environment-specific $N(\mu_e(e), I)$ .
- The decoder is independent of the environment index $e$ , enforcing that the generation mechanism is invariant.
Optimization: Maximizes the Evidence Lower Bound (ELBO) with a specific factorization of the posterior that respects the causal structure ( $Z_{inv}$ depends on $X$ and $Z_e$ ; $Z_e$ depends only on $X$ ).

3. Key Contributions

Unsupervised IRM Framework: The first formal extension of IRM to settings without labels, redefining invariance as the equality of feature distributions across environments.
Novel Algorithms:
- PICA: A closed-form, linear solution for Gaussian data that extracts invariant directions.
- VIAE: A deep learning architecture that disentangles invariant and environmental latent factors, enabling causal interventions.
Environment Transfer Capability: The framework allows for "Environment Transfer," where data from one environment can be transformed into another (e.g., changing the background of an image) while preserving invariant content (e.g., the object identity). This is achieved by swapping the $Z_e$ component while keeping $Z_{inv}$ fixed.
Fairness Application: Demonstrated the application of VIAE for algorithmic fairness by treating sensitive attributes (e.g., gender) as environmental features, allowing for the generation of fair representations where decisions are based on invariant features.

4. Experimental Results

The methods were evaluated on synthetic data, modified MNIST datasets, and CelebA.

Synthetic Data: PICA successfully identified the invariant subspace in a 3D Gaussian mixture where two dimensions varied by environment and one remained constant.
SMNIST & SCMNIST:
- SMNIST: MNIST digits with added squares in different corners (environmental shift). VIAE successfully separated the digit identity ( $Z_{inv}$ ) from the square position ( $Z_e$ ).
- SCMNIST: MNIST digits colored differently per environment (Red vs. Green). VIAE learned to ignore color for classification.
- Classifier Performance: A linear classifier trained only on the invariant features ( $Z_{inv}$ ) achieved high accuracy (~84%) on test data, while a classifier on environmental features ( $Z_e$ ) performed near random chance. Crucially, predicting the environment from $Z_{inv}$ yielded random accuracy, proving the features were truly invariant.
Environment Transfer:
- The model successfully transferred images from "seen" environments (e.g., top-left square to bottom-right square) and, via a heuristic averaging approach, from "unseen" environments (though performance degraded on complex shifts like SCMNIST).
CelebA (Fairness):
- The model successfully disentangled gender (environment) from facial identity (invariant).
- It could generate "male" or "female" versions of the same face by sampling different $Z_e$ priors while keeping $Z_{inv}$ fixed.
- It performed environment transfer (converting a male face to a female face) while preserving identity features like pose and expression.

5. Significance and Future Work

Significance: This work bridges the gap between causal representation learning and unsupervised learning. It provides a theoretical and practical toolkit for learning robust features when labels are scarce, addressing a critical bottleneck in deploying IRM in real-world scenarios. The ability to perform environment transfer without labels opens new avenues for data augmentation and fairness interventions.
Limitations & Future Directions:
- Unseen Environments: Generalization to completely unseen environments ( $E_{test}$ ) remains challenging. The current heuristic (averaging encoders) works for simple shifts but fails for complex ones (e.g., SCMNIST blue channel).
- Architecture: The current work uses standard VAEs. Future work suggests integrating advanced generative models (Diffusion, GANs) and meta-learning (MAML) to improve few-shot adaptation to new environments.
- Theoretical Guarantees: A complete theoretical scheme for zero-shot transfer to unseen environments is proposed as a key direction for future research.

In summary, the paper successfully demonstrates that invariant risk minimization principles can be applied to unsupervised learning, offering a powerful method to separate stable causal factors from spurious environmental correlations without requiring labeled data.