RetinaVision: XAI-Driven Augmented Regulation for Precise Retinal Disease Classification using deep learning framework

This study presents RetinaVision, an XAI-driven deep learning framework utilizing Xception and InceptionV3 architectures with advanced data augmentation to achieve high-accuracy retinal disease classification from OCT images while ensuring clinical interpretability through GradCAM and LIME.

Mohammad Tahmid Noor, Shayan Abrar, Jannatul Adan Mahi, Md Parvez Mia, Asaduzzaman Hridoy, Samanta Ghosh

Published 2026-02-24
📖 4 min read☕ Coffee break read

Imagine your eye is like a high-end camera, and the retina is the film (or sensor) at the back that captures the picture. If that film gets damaged or develops "scratches" (diseases), your vision fades. The problem is, doctors have to look at these tiny, blurry scratches on a screen one by one. It's like trying to find a specific typo in a library of a million books by reading every single page manually. It's slow, tiring, and sometimes, a tired doctor might miss a tiny clue.

This paper introduces RetinaVision, a new "super-assistant" for doctors. It's an artificial intelligence (AI) system designed to look at eye scans and instantly tell the doctor exactly what's wrong, acting like a tireless, hyper-observant intern who never blinks.

Here is the breakdown of how they built this assistant, using simple analogies:

1. The Training Camp (The Data)

To teach this AI, the researchers didn't just show it a few pictures. They gave it a massive library of 24,000 eye scans (called OCT images). Think of these scans as "X-rays" of the eye's layers. The AI had to learn to spot 8 different types of eye diseases, from common ones like "Drusen" (dust-like deposits) to serious ones like "Diabetic Macular Edema" (swelling).

2. The Two Student Athletes (The AI Models)

The researchers didn't just pick one brain; they trained two different "super-brains" (AI architectures) to see who was better:

  • Xception: Think of this as a sprinter who is incredibly fast and efficient at spotting details. It uses a special technique to ignore unnecessary noise and focus only on the important parts of the image.
  • InceptionV3: This is like a detective who looks at a problem from many different angles at once, piecing together small clues to form a big picture.

3. The "Mix-and-Match" Gym (Data Augmentation)

To make sure these AI students didn't just memorize the answers (cheating), the researchers used a trick called CutMix and MixUp.

  • The Analogy: Imagine you are teaching a child to recognize a dog. If you only show them a Golden Retriever, they might think all dogs are Golden Retrievers.
  • The Trick: The researchers took a picture of a "sick eye," cut out a piece of it, and pasted it onto a "healthy eye." Then they told the AI, "This is a mix of both." This forced the AI to learn the actual features of the disease, not just the background pattern. It's like training a chef by giving them random ingredients and asking them to make a specific dish, ensuring they truly understand the flavors, not just the recipe.

4. The "Black Box" Problem (Explainable AI)

Usually, AI is a "black box." You give it an input, and it gives an answer, but you don't know why. Doctors can't trust a machine if they don't understand its logic.

  • The Solution: The researchers added Grad-CAM and LIME.
  • The Analogy: Imagine the AI is a student taking a test. Instead of just writing the answer "A," the AI now highlights the specific words in the question that led it to that answer.
  • The Result: When RetinaVision says, "This eye has a disease," it also draws a glowing red heatmap over the exact spot on the scan where the disease is. It's like a teacher saying, "I gave you an A because you circled the correct evidence here." This builds trust with the doctor.

5. The Results: Who Won?

After a grueling training session (50 rounds of practice), the results were in:

  • Xception was the champion, getting 95.25% of the diagnoses correct.
  • InceptionV3 came in a very close second with 94.82%.

Both were significantly better than previous methods, which were like "good students" scoring around 80-90%. Xception was the "valedictorian."

6. The Real-World App (RetinaVision)

The researchers didn't just leave this in a lab. They built a website app (RetinaVision).

  • How it works: A doctor (or even a patient in a clinic) can upload an eye scan. Within seconds, the app analyzes it, tells them what disease it is, gives a confidence score (e.g., "99% sure"), and shows the heatmap of where the problem is.

The Bottom Line

This paper is about building a digital safety net for our eyes. By using smart AI that can "see" diseases faster than a human and explain why it sees them, we can catch eye problems earlier. Early detection is the difference between losing your sight and keeping it clear.

In short: They taught two super-smart computers to read eye scans, taught them to learn from mixed-up examples so they don't get confused, made them show their work so doctors trust them, and turned it all into a website that could save eyesight.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →