Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information

This paper presents an LLM-based Werewolf AI agent for the AIWolfDial 2024 shared task that improves utterance consistency and character maintenance by leveraging dialogue summaries and manually designed personas.

Yoshiki Tanaka, Takumasa Kaneko, Hiroki Onozeki, Natsumi Ezure, Ryuichi Uehara, Zhiyang Qi, Tomoya Higuchi, Ryutaro Asahara, Michimasa Inaba

Published Tue, 10 Ma
📖 5 min read🧠 Deep dive

Imagine a high-stakes game of Mafia (or Werewolf) played over a text chat. In this game, some players are "villagers" trying to find the bad guys, while others are "werewolves" and a "possessed" person trying to trick everyone. The catch? You can't see who is who. You have to figure it out just by listening to what people say, watching how they act, and remembering everything that happened yesterday.

The researchers in this paper built a team of AI players to compete in this game. Their goal was to make these AI players act like real humans: smart, consistent, and with distinct personalities.

Here is how they did it, explained with some everyday analogies:

1. The Problem: The "Too Much Information" Overload

Imagine you are playing a game that lasts for three days. By Day 2, you have to remember every single thing everyone said on Day 1, plus the new stuff on Day 2. If you tried to read the entire transcript of a 3-day conversation every time you wanted to speak, your brain would get tired, you'd get confused, and you might forget the most important clues.

The AI's Solution: The "Cheat Sheet" (Dialogue Summarization)
Instead of forcing the AI to read the entire 50-page transcript of the game history every time it speaks, the researchers gave it a condensed cheat sheet.

  • How it works: At the end of every day, the AI uses its own brain (a Large Language Model) to write a short summary of what happened. It highlights who claimed to be the "Seer" (the detective), who was suspicious, and who got voted out.
  • The Benefit: When the game starts on Day 2, the AI doesn't need to re-read the whole book. It just reads the "Cliff's Notes" summary. This keeps the AI fast, saves money on computer power, and helps it remember the important clues without getting lost in the noise.

2. The Problem: The "Chameleon" Effect

Have you ever talked to a friend who suddenly changes their personality based on who they are talking to? One minute they are a shy librarian, and the next they are a loud party animal because they are trying to fit in. In a game like Werewolf, if your AI agent changes its tone or personality every time someone else speaks, the other players will know it's a robot. It breaks the immersion.

The AI's Solution: The "Character Sheet" (Persona Design)
To stop the AI from acting like a chameleon, the researchers gave each AI a strict character profile, like a role-playing game character sheet.

  • The Villager/Seer: They are the "King of a Kingdom." They speak with dignity, pride, and strictness. They don't use slang.
  • The Werewolf: They are a "17-year-old soccer player." They are energetic, loud, use slang, and hate being polite.
  • The Possessed: They are a "shy, stuttering gamer." They talk nervously and act awkward.
  • The Magic: The researchers didn't just tell the AI what to say; they gave it examples of how these characters talk. Every time the AI speaks, it checks its character sheet to make sure it sounds like the "Soccer Player" or the "King," not like a generic robot. This makes the AI feel like a real person with a consistent soul.

3. The Problem: The "Oops, I Contradicted Myself" Moment

In a complex game, it's easy to make a mistake. You might say, "I think Player A is the werewolf," and then five minutes later vote for Player B without explaining why. This looks suspicious and untrustworthy.

The AI's Solution: The "Thinking Aloud" Strategy (Chain-of-Thought)
Before the AI makes a big move (like voting someone out or guessing a role), the researchers made it think out loud first.

  • How it works: The AI is forced to write down its reasoning step-by-step before it types its final answer. It asks itself: "Why do I suspect Player A? What did they say yesterday? Does this match my plan?"
  • The Benefit: This acts like a safety net. It ensures the AI's final vote matches its earlier arguments, making it look much more logical and human.

The Result: A Winning Team

The researchers tested their AI by having them play against each other. The results showed that:

  1. They remembered everything: Even on Day 2, the AI could recall specific details from Day 1 because of the "Cheat Sheet" summaries.
  2. They stayed in character: The "King" sounded royal the whole time, and the "Soccer Player" stayed energetic, even when the game got stressful.
  3. They were consistent: They didn't flip-flop on their votes or contradict their own stories.

In a nutshell: The researchers taught the AI to be a better game player by giving it a summary of the past (so it doesn't forget), a strict personality guide (so it doesn't act weird), and a thinking process (so it doesn't make mistakes). This makes the AI feel less like a computer program and more like a clever, consistent human player.