Dutch Metaphor Extraction from Cancer Patients' Interviews and Forum Data using LLMs and Human in the Loop

This paper presents a human-in-the-loop approach using large language models with various prompting strategies to extract Dutch metaphors from cancer patient interviews and forum data, resulting in the creation of the HealthQuote.NL corpus to enhance healthcare communication and personalized care.

Lifeng Han, David Lindevelt, Sander Puts, Erik van Mulligen, Suzan Verberne

Published 2026-03-05
📖 4 min read☕ Coffee break read

Imagine you are trying to understand a patient's experience with cancer. Sometimes, the medical terms are too cold or technical. Instead, patients often speak in pictures. They might say their body is a "battlefield," their treatment is a "long journey," or their tumor is a "weed" that won't die. These pictures are called metaphors.

This paper is about a team of researchers who wanted to collect these pictures from Dutch cancer patients to help doctors and families understand them better. But there was a problem: there were thousands of stories, interviews, and forum posts, and reading them all by hand would take forever.

So, the researchers decided to hire a digital detective (an Artificial Intelligence, or LLM) to do the heavy lifting. Here is how they did it, explained simply:

1. The Mission: Finding Hidden Gems

The researchers had two big piles of "gold dust" (data):

  • Pile A: Transcripts of real-life interviews where patients talked about their feelings.
  • Pile B: Thousands of blog posts and comments from a Dutch cancer website (like a digital support group).

Their goal was to find every metaphor hidden inside these texts. But just like a child looking for a needle in a haystack, the AI had a hard time. At first, the AI was a bit of a daydreamer.

2. The Problem: The AI's "Imagination"

When the researchers first asked the AI, "Find the metaphors," the AI got a bit too creative. It made three main mistakes:

  • The Hallucinator: Sometimes, the AI invented metaphors that weren't actually there. It was like a student making up a quote for a history essay because they didn't want to admit they forgot the facts.
  • The Idiom Confuser: The AI couldn't tell the difference between a real metaphor and a common phrase. If a Dutch person said, "It's raining cats and dogs," the AI might think that's a deep metaphor about animals, when it's just a standard saying.
  • The Summarizer: Instead of pulling out the exact words the patient used, the AI would rewrite the story. It was like asking someone to copy a painting, but they just wrote a description of the painting instead.

3. The Solution: The "Human-in-the-Loop" Team

To fix the AI's daydreaming, the researchers built a training camp for the AI. They didn't just let the AI guess; they gave it a strict rulebook and a human supervisor.

  • The Rulebook (Prompting): They taught the AI how to think step-by-step (like a detective solving a crime). They said, "Don't guess. Look at the exact sentence. Is it a real picture? Is it a common phrase? Prove it."
  • The Supervisor (Human Experts): After the AI found a list of metaphors, three human experts (linguists who speak Dutch) acted as the "quality control inspectors." They checked every single one. If the AI was wrong, they crossed it out.

4. The Result: The "HealthQuote.NL" Treasure Chest

After all this training and checking, the team created a special collection called HealthQuote.NL.

  • They found 130 validated metaphors.
  • These weren't just random words; they were organized like a library. Some were about Journeys (life as a road), some about Battles (fighting the disease), and some about Nature (tumor as a weed).
  • They even found some very creative ones, like a patient comparing their body to a damaged car that needs new bodywork, or cancer as an uninvited party that won't leave.

5. Why Does This Matter?

Think of this collection as a dictionary of feelings.

  • For Doctors: If a doctor knows that a patient sees their treatment as a "journey," they can use that language to explain the next step, making the patient feel understood rather than confused.
  • For Researchers: It helps us understand how people in the Netherlands process the trauma of cancer, which might be different from people in the UK or the US.
  • For the Future: It proves that we can use AI to listen to patients, as long as we have humans to double-check the work.

The Bottom Line

The researchers built a bridge between human emotion and machine speed. They taught an AI to listen to the poetry of cancer patients, fixed its mistakes with human eyes, and created a tool that helps healthcare become more personal, empathetic, and clear. It's not just about data; it's about helping people feel heard.