MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals

The paper introduces MERLIN, a novel training framework accompanied by the EM-100k dataset and EM-Bench benchmark, to overcome data scarcity, evaluation gaps, and low-SNR fragility in building robust Multimodal Large Language Models for electromagnetic signals.

Junyu Shen, Zhendong She, Chenghanyu Zhang, Yuchuang Sun, Luqing Luo, Dingwei Tan, Zonghao Guo, Bo Guo, Zehua Han, Wupeng Xie, Yaxin Mu, Peng Zhang, Peipei Li, Fengxiang Wang, Yangang Sun, Maosong Sun

Published 2026-03-10
📖 5 min read🧠 Deep dive

Imagine you have a super-smart robot (a Large Language Model, or LLM) that is amazing at reading books, writing stories, and chatting with people. But right now, this robot is completely deaf to the world of radio waves, radar, and Wi-Fi signals. It can't "hear" the electromagnetic (EM) signals that power our modern world.

The paper you shared, "MERLIN," is like a blueprint for teaching this robot to not only hear these invisible signals but to understand them, even when the signal is very weak and full of static (noise).

Here is the story of how they did it, broken down into three simple parts:

1. The Problem: The Robot is Deaf and the Signal is Faint

Currently, trying to make these robots understand radio signals is like trying to teach someone to read a book written in a language they don't know, while they are standing in a hurricane.

  • No Dictionary: There are almost no books (datasets) that pair radio signals with human explanations. The robot has nothing to learn from.
  • No Test: There is no standardized exam (benchmark) to see if the robot is actually getting smarter or just guessing.
  • The Static: When the signal is weak (low "Signal-to-Noise Ratio"), it's like trying to hear a whisper in a rock concert. The robot gets confused and fails completely.

2. The Solution: Building the Library, the Exam, and the Training Camp

The team created three things to fix this:

A. The Library: EM-100k

They realized the robot needed a massive library to study. So, they built EM-100k, a dataset containing 100,000 pairs of radio signals and their descriptions.

  • Analogy: Imagine they took 35 million raw radio "snippets" (like recording every possible sound a car engine makes) and hired experts to write down exactly what each sound means (e.g., "This is a radar pulse," "This is a jamming signal"). Now the robot has a massive textbook to study.

B. The Exam: EM-Bench

To know if the robot is actually learning, they built EM-Bench, a rigorous test.

  • Analogy: Instead of just asking, "Do you know what a radio is?", they give the robot a multi-level exam.
    • Perception: "What kind of modulation is this?" (Like identifying the instrument playing a note).
    • Reasoning: "Someone is jamming this signal; what strategy should we use to fight back?" (Like a chess player figuring out a counter-move).
    • The exam covers everything from spotting simple signals to planning complex electronic warfare strategies.

C. The Training Camp: MERLIN

This is the most important part. They didn't just feed the robot more data; they invented a new way to train it called MERLIN.

The Two-Stage Training:

  1. Stage 1 (The Basics): They teach the robot to match signals with text using the new library (EM-100k). It learns the "vocabulary" of radio waves.
  2. Stage 2 (The Noise Challenge): This is the magic trick.
    • The Problem: When the signal is noisy (static), the robot's brain gets foggy. It can't tell the difference between a real signal and the noise.
    • The Fix: They use a technique called Knowledge Distillation.
    • The Analogy: Imagine a Teacher and a Student.
      • The Teacher is the robot looking at a perfect, crystal-clear signal.
      • The Student is the robot looking at the same signal but covered in static and noise.
      • The Teacher says, "Even though you see noise, I see a clear pattern. Here is what the pattern should look like."
      • The Student tries to mimic the Teacher's "clean" understanding, ignoring the noise.
    • The Secret Sauce: They added a special filter (called the Denoising Subspace Module) that acts like noise-canceling headphones for the robot's brain. It strips away the static before the robot tries to understand the signal, forcing it to learn the true shape of the message.

3. The Result: A Super-Listener

After this training, the MERLIN robot became a master of the electromagnetic world.

  • It passed the exam: It scored higher than any other model (even huge, expensive ones from big tech companies) on the EM-Bench test.
  • It's tough: Even when the signal is very weak and noisy (like trying to hear a whisper in a storm), MERLIN didn't crash. It kept working because it learned to ignore the static and focus on the core message.

Summary

In short, the paper says: "We built a giant library of radio signals, created a hard test to measure progress, and invented a special training method where a 'clean' teacher guides a 'noisy' student. This allows our AI to finally understand the invisible language of the electromagnetic spectrum, even when the connection is terrible."

This is a huge step forward for things like radar, secure communications, and autonomous vehicles that need to "hear" the world around them without getting confused by interference.