Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information

Imagine a high-stakes game of Mafia (or Werewolf) played over a text chat. In this game, some players are "villagers" trying to find the bad guys, while others are "werewolves" and a "possessed" person trying to trick everyone. The catch? You can't see who is who. You have to figure it out just by listening to what people say, watching how they act, and remembering everything that happened yesterday.

The researchers in this paper built a team of AI players to compete in this game. Their goal was to make these AI players act like real humans: smart, consistent, and with distinct personalities.

Here is how they did it, explained with some everyday analogies:

1. The Problem: The "Too Much Information" Overload

Imagine you are playing a game that lasts for three days. By Day 2, you have to remember every single thing everyone said on Day 1, plus the new stuff on Day 2. If you tried to read the entire transcript of a 3-day conversation every time you wanted to speak, your brain would get tired, you'd get confused, and you might forget the most important clues.

The AI's Solution: The "Cheat Sheet" (Dialogue Summarization)
Instead of forcing the AI to read the entire 50-page transcript of the game history every time it speaks, the researchers gave it a condensed cheat sheet.

How it works: At the end of every day, the AI uses its own brain (a Large Language Model) to write a short summary of what happened. It highlights who claimed to be the "Seer" (the detective), who was suspicious, and who got voted out.
The Benefit: When the game starts on Day 2, the AI doesn't need to re-read the whole book. It just reads the "Cliff's Notes" summary. This keeps the AI fast, saves money on computer power, and helps it remember the important clues without getting lost in the noise.

2. The Problem: The "Chameleon" Effect

Have you ever talked to a friend who suddenly changes their personality based on who they are talking to? One minute they are a shy librarian, and the next they are a loud party animal because they are trying to fit in. In a game like Werewolf, if your AI agent changes its tone or personality every time someone else speaks, the other players will know it's a robot. It breaks the immersion.

The AI's Solution: The "Character Sheet" (Persona Design)
To stop the AI from acting like a chameleon, the researchers gave each AI a strict character profile, like a role-playing game character sheet.

The Villager/Seer: They are the "King of a Kingdom." They speak with dignity, pride, and strictness. They don't use slang.
The Werewolf: They are a "17-year-old soccer player." They are energetic, loud, use slang, and hate being polite.
The Possessed: They are a "shy, stuttering gamer." They talk nervously and act awkward.
The Magic: The researchers didn't just tell the AI what to say; they gave it examples of how these characters talk. Every time the AI speaks, it checks its character sheet to make sure it sounds like the "Soccer Player" or the "King," not like a generic robot. This makes the AI feel like a real person with a consistent soul.

3. The Problem: The "Oops, I Contradicted Myself" Moment

In a complex game, it's easy to make a mistake. You might say, "I think Player A is the werewolf," and then five minutes later vote for Player B without explaining why. This looks suspicious and untrustworthy.

The AI's Solution: The "Thinking Aloud" Strategy (Chain-of-Thought)
Before the AI makes a big move (like voting someone out or guessing a role), the researchers made it think out loud first.

How it works: The AI is forced to write down its reasoning step-by-step before it types its final answer. It asks itself: "Why do I suspect Player A? What did they say yesterday? Does this match my plan?"
The Benefit: This acts like a safety net. It ensures the AI's final vote matches its earlier arguments, making it look much more logical and human.

The Result: A Winning Team

The researchers tested their AI by having them play against each other. The results showed that:

They remembered everything: Even on Day 2, the AI could recall specific details from Day 1 because of the "Cheat Sheet" summaries.
They stayed in character: The "King" sounded royal the whole time, and the "Soccer Player" stayed energetic, even when the game got stressful.
They were consistent: They didn't flip-flop on their votes or contradict their own stories.

In a nutshell: The researchers taught the AI to be a better game player by giving it a summary of the past (so it doesn't forget), a strict personality guide (so it doesn't act weird), and a thinking process (so it doesn't make mistakes). This makes the AI feel less like a computer program and more like a clever, consistent human player.

Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information

1. The Problem: The "Too Much Information" Overload

2. The Problem: The "Chameleon" Effect

3. The Problem: The "Oops, I Contradicted Myself" Moment

The Result: A Winning Team

1. Problem Statement

2. Methodology

A. Dialogue Summarization for Context Management

B. Persona-Driven Utterance Generation

C. Chain-of-Thought (CoT) Reasoning for Action Decisions

D. Prompt Architecture

3. Key Contributions

4. Results

5. Significance

Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information

1. The Problem: The "Too Much Information" Overload

2. The Problem: The "Chameleon" Effect

3. The Problem: The "Oops, I Contradicted Myself" Moment

The Result: A Winning Team

1. Problem Statement

2. Methodology

A. Dialogue Summarization for Context Management

B. Persona-Driven Utterance Generation

C. Chain-of-Thought (CoT) Reasoning for Action Decisions

D. Prompt Architecture

3. Key Contributions

4. Results

5. Significance

More like this

One Language, Two Scripts: Probing Script-Invariance in LLM Concept Representations

MultiGraSCCo: A Multilingual Anonymization Benchmark with Annotations of Personal Identifiers

ConFu: Contemplate the Future for Better Speculative Sampling

SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex Computation

Automated Thematic Analysis for Clinical Qualitative Data: Iterative Codebook Refinement with Full Provenance