A benchmark for joint dialogue satisfaction, emotion recognition, and emotion state transition prediction

This paper addresses the scarcity of Chinese datasets for dynamic emotion tracking and satisfaction prediction by introducing a novel multi-task, multi-label dialogue dataset that jointly supports satisfaction recognition, emotion recognition, and emotional state transition prediction.

Jing Bian, Haoxiang Su, Liting Jiang, Di Wu, Ruiyu Fang, Xiaomeng Huang, Yanbing Li, Shuangyong Song, Hao Huang

Published 2026-03-05
📖 4 min read☕ Coffee break read

Imagine you're running a busy coffee shop. You have a barista (the customer service agent) and a line of customers. Sometimes a customer is happy, sometimes they are frustrated because their latte is cold, and sometimes they are just confused about the menu.

If you only look at the final receipt to see if the customer was happy, you miss the whole story. Did they sigh in relief when the barista fixed the order? Did they get angry when they had to wait too long? This paper is about building a "super-observer" that watches the entire conversation, not just the end result.

Here is a simple breakdown of what the researchers did:

1. The Problem: The "Snapshot" vs. The "Movie"

Most previous studies tried to guess if a customer was happy by looking at a single sentence, like a snapshot photo. But human emotions are more like a movie. They change! A customer might start out calm, get a little annoyed when the answer isn't clear, and then feel relieved when the problem is solved.

The researchers realized that to truly understand if a customer is satisfied, you need to track these emotional "plot twists" throughout the whole conversation. Also, they noticed that while there were many English datasets for this, there was a huge gap in Chinese data that tracked these emotional shifts.

2. The Solution: Building a Massive "Emotion Library"

To fix this, the team created a giant new dataset (a library of conversations) specifically for Chinese customer service calls.

  • The Scale: They didn't just write a few scripts; they simulated 90,000 full conversations. That's like recording every single interaction in a massive call center for months.
  • The Content: These conversations cover real-life scenarios: asking about phone plans, complaining about a broken internet connection, or just asking for help.
  • The Magic Labels: Every time a customer spoke, the researchers tagged it with three things:
    1. What are they feeling right now? (e.g., Angry, Grateful, Anxious, or just "Neutral").
    2. How did their mood change? (e.g., Did they go from "Calm" to "Angry"? Or from "Anxious" to "Relieved"?)
    3. Are they happy with the service? (Satisfied, Dissatisfied, or Neutral).

Think of this dataset as a training manual for AI. Instead of just teaching the AI to say "Hello," they are teaching it to read the room, understand the customer's mood swing, and predict if the customer is going to leave a good review or a bad one.

3. The Experiment: Teaching AI to Read Minds

The researchers took several powerful "Large Language Models" (think of these as very smart, digital brains like the one you are talking to right now) and tested them on this new library.

They asked the AI three questions for every sentence in the conversation:

  • "Is the customer angry or happy right now?"
  • "Did their mood just flip from good to bad?"
  • "Will this customer be happy with the service at the end?"

The Results:

  • The AI models were pretty good at guessing the final satisfaction (like predicting the movie ending).
  • They were okay at spotting the current emotion.
  • The Hard Part: Predicting the change in emotion (the "plot twist") was the hardest. It's like trying to guess exactly when a character in a movie will start crying; it's subtle and tricky.

4. Why This Matters

Why do we care about a database of phone calls?

  • Better Customer Service: If an AI can detect that a customer is getting anxious before they even say "I'm angry," the system can alert a human agent to step in and calm things down.
  • Business Success: Happy customers stay loyal. By understanding the emotional journey, companies can fix problems before the customer quits.
  • Language Bridge: This is the first major step in giving Chinese customer service the same emotional intelligence tools that English systems have had for a while.

In a nutshell: The researchers built a massive, detailed map of how Chinese customers feel during service calls. They used this map to teach AI how to not just hear words, but to understand the emotional journey behind them, helping businesses serve people better.