FinSight-Net:A Physics-Aware Decoupled Network with Frequency-Domain Compensation for Underwater Fish Detection in Smart Aquaculture

FinSight-Net is an efficient, physics-aware underwater fish detection framework that leverages a Multi-Scale Decoupled Dual-Stream Processing bottleneck and an Efficient Path Aggregation FPN to counteract turbidity-induced degradation, achieving state-of-the-art accuracy with reduced computational cost for smart aquaculture monitoring.

Jinsong Yang, Zeyuan Hu, Yichen Li, Hong Yu

Published 2026-02-24
📖 4 min read☕ Coffee break read

Imagine you are trying to take a clear photo of a school of fish swimming in a murky, cloudy pond. It's a nightmare for a camera. The water acts like a dirty window: it blocks certain colors (making everything look blue-green), it scatters light like fog (creating a "veil" over the image), and it blurs the edges of the fish.

Most computer programs trying to "see" these fish today are like a student trying to solve a math problem by brute force. They throw more and more complex tools at the problem, hoping that if they just look harder, they'll figure it out. But this makes the software slow, heavy, and expensive to run on small devices like underwater cameras or drones.

FinSight-Net is a new, smarter approach. Instead of just "trying harder," it understands why the water is messing up the picture and fixes the specific problems one by one. Think of it as a specialized pair of glasses designed just for underwater vision.

Here is how it works, broken down into simple analogies:

1. The "Specialized Team" (MS-DDSP)

Imagine you have a team of four detectives trying to solve a crime in a foggy room.

  • The Old Way: You put all four detectives in a single room and tell them to "look at everything." They get confused, and the fog (noise) hides the clues.
  • The FinSight-Net Way: You split them into four specialized teams, each with a different job:
    • Team 1 (The Big Picture): Looks for the general shape of the fish, ignoring small details that might be blurry.
    • Team 2 (The Fog Remover): Specifically trained to ignore the "fog" (backscattering noise) caused by floating dirt in the water.
    • Team 3 (The Color Fixer): Knows that red light disappears quickly underwater, so it works extra hard to restore the missing red colors so the fish looks natural.
    • Team 4 (The Detail Keeper): Makes sure the tiny, important bits (like fins or scales) don't get lost in the process.

At the end, a smart manager (an attention mechanism) listens to all four teams and says, "Okay, the fog is thick today, so I'll listen more to Team 2. The colors are off, so I'll trust Team 3 more." This creates a perfect, clear picture of the fish.

2. The "Express Elevator" (EPA-FPN)

In deep learning, information usually travels down a long, winding staircase. As it goes down, it gets heavier and loses details (like a message passed down a line of people that gets garbled). By the time it reaches the bottom, the fine details of the fish are gone.

  • The Old Way: The message has to go down every single step, getting slower and losing clarity.
  • The FinSight-Net Way: They build an Express Elevator. This allows the "high-definition" details from the top of the building (the raw image) to zip straight down to the bottom, bypassing the slow, noisy staircase. This ensures that even the tiniest edge of a fish fin is preserved, making it much easier to pinpoint exactly where the fish is.

3. Why It Matters

  • It's Lighter: Because it's so smart about how it works, it doesn't need a supercomputer to run. It can run on small, battery-powered devices attached to fish farms.
  • It's Faster: It processes images in real-time, which is crucial for things like automatic feeding systems. If a fish is hungry, the system needs to know now, not five minutes later.
  • It's More Accurate: In tests, it found fish in very cloudy water much better than the current best systems (beating the latest YOLO models by a significant margin).

The Bottom Line

FinSight-Net is like giving a robot fisherman a pair of smart glasses that know exactly how water distorts light. Instead of guessing, it actively cleans up the fog, restores the missing colors, and keeps the fine details sharp. This allows farmers to monitor their fish health and feeding automatically, even in the murkiest ponds, without needing expensive, heavy equipment.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →