SLER-IR: Spherical Layer-wise Expert Routing for All-in-One Image Restoration

The paper proposes SLER-IR, a novel all-in-one image restoration framework that utilizes spherical layer-wise expert routing, a spherical uniform degradation embedding with contrastive learning, and a global-local granularity fusion module to effectively overcome feature interference and spatial non-uniform degradations, achieving state-of-the-art performance across multiple restoration tasks.

Peng Shurui, Xin Lin, Shi Luo, Jincen Ou, Dizhe Zhang, Lu Qi, Truong Nguyen, Chao Ren

Published 2026-03-09
📖 5 min read🧠 Deep dive

Imagine you have a broken camera. Sometimes the lens is dirty (blur), sometimes it's raining outside (rain streaks), sometimes it's too dark (low-light), and sometimes the image is grainy (noise).

In the past, fixing these problems was like having a toolbox where you needed a different, specialized tool for every single problem. If you had a blurry photo, you grabbed the "blur fixer." If it was rainy, you grabbed the "rain remover." But what if your photo was blurry and rainy? You'd have to use two tools, or worse, try to guess which one to use. This is slow, inefficient, and often leads to messy results.

Recently, scientists tried to build a "Swiss Army Knife" camera repairman—one single AI model that could handle any problem. But these "all-in-one" models had a flaw: they were like a general practitioner who knows a little bit about everything but isn't great at anything specific. They often got confused, mixing up the instructions for fixing rain with the instructions for fixing blur, resulting in a photo that still looked a bit muddy.

Enter SLER-IR: The "Smart Team of Specialists" with a Magic Compass.

The authors of this paper propose a new system called SLER-IR. Instead of one generalist, they built a team of specialists who work together dynamically. Here is how it works, using simple analogies:

1. The Layer-by-Layer Assembly Line (Spherical Layer-wise Expert Routing)

Imagine a photo restoration process as an assembly line with 8 stations (layers).

  • Old Way: Every station had the same worker trying to do everything.
  • SLER-IR Way: At each of the 8 stations, there are 3 different specialists waiting.
    • Station 1 might need a "Texture Expert."
    • Station 5 might need a "Color Expert."
    • Station 8 might need a "Detail Expert."

The magic is that the system doesn't just pick one specialist for the whole job. It looks at the photo at every single station and asks: "Who is best for this specific step?"

  • If the photo is rainy, Station 1 might pick the "Rain Specialist," but Station 5 might pick the "Blur Specialist" because the rain streaks are also blurring the background.
  • This creates thousands of unique paths through the factory. The system builds a custom repair plan for every single photo on the fly.

2. The Magic Compass (Spherical Uniform Degradation Embedding)

How does the system know which specialist to pick? It needs a way to "read" the damage.

  • The Problem: Imagine trying to navigate a city using a map where the distances are distorted. If "Rain" and "Blur" look too close together on the map, the driver (the AI) gets confused and picks the wrong route. This is what happens with standard AI math (linear spaces).
  • The Solution: The authors put the damage types onto a globe (a hypersphere).
    • Think of the globe like a basketball. They place "Rain," "Blur," "Darkness," and "Noise" as pins stuck into the ball.
    • They use a special math trick to ensure these pins are evenly spaced all over the ball, so no two types are accidentally too close.
    • When a new photo comes in, the system draws a line from the center of the ball to the photo's "damage pin." It then simply looks for the nearest specialist pin on the surface. Because the pins are perfectly spaced, the choice is always clear and accurate. This prevents the AI from getting confused.

3. The Big Picture vs. The Close-Up (Global-Local Granularity Fusion)

Sometimes, a photo has a problem in just one corner (like a raindrop on the left side) but is fine on the right.

  • The Problem: If the AI only looks at the "whole picture" (Global), it might miss the raindrop. If it only looks at tiny "patches" (Local), it might lose the context of the whole scene. Also, during training, the AI only sees small squares, but in real life, it sees the whole image. This causes a "mismatch."
  • The Solution: The system uses a two-sense approach.
    • Sense 1 (The Tourist): Looks at the whole image to understand the general vibe (e.g., "It's a dark forest").
    • Sense 2 (The Detective): Zooms in to find specific clues (e.g., "There is a rain streak here").
    • The system combines these two senses to create a guide map. It tells the specialists exactly where to work and what to fix, ensuring that a rain streak in the corner gets fixed without messing up the clear sky in the center.

The Result

By combining these three ideas, SLER-IR acts like a highly organized, adaptable team of experts who communicate perfectly.

  • They don't get confused by mixed problems (like rain + blur).
  • They know exactly who to call at every step of the process.
  • They look at both the big picture and the tiny details.

In short: While other methods try to force one brain to do everything, SLER-IR builds a smart, flexible team that changes its strategy instantly based on the specific damage, resulting in crystal-clear photos even in the worst conditions. The experiments in the paper show that this new "team" beats all the previous "generalists" in both speed and quality.