Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer
Imagine you are trying to figure out how a friend decides what to eat for dinner. You have two ways to learn about their process:
- The "What" (Behavior): You watch them order. They pick the pizza. You see the result.
- The "How" (Think-Aloud): You ask them to talk through their thoughts while deciding. They say, "Hmm, I'm hungry, but pizza is heavy. Maybe I should check the calories first, then compare the cost."
For a long time, scientists trying to build computer models of human thinking have only had access to the "What." They watch people make choices (like picking a risky gamble or a safe one) and try to reverse-engineer the math behind it.
The problem is that the "What" is often a foggy mirror. Many different internal math formulas can produce the exact same final choice. It's like seeing a car drive down a street; you know it moved from A to B, but you don't know if the driver was using a GPS, a map, or just guessing. This makes the computer models "under-determined"—there are too many possible answers, and the computer might pick the wrong one just because it fits the data okay.
The New Approach: Listening to the Inner Monologue
This paper introduces a new way to build these models. Instead of just watching the final choice, the researchers fed the computer models the "How" as well—the actual spoken thoughts (Think-Aloud traces) people had while making decisions.
They used a super-smart AI (a Large Language Model) to act as a detective. The AI was given two types of clues:
- Clue A: The list of choices the person made.
- Clue B: The transcript of what the person said while making those choices.
The AI then tried to write a computer program that could explain both the choices and the spoken thoughts.
What They Found
The researchers tested this on people making risky decisions (like choosing between a sure small reward or a chance at a big reward). Here is what happened when they added the "spoken thoughts" to the mix:
1. The Models Got Smarter (Better Predictions)
When the AI used only the choices, it made decent guesses. But when it used the choices plus the spoken thoughts, the models became much better at predicting what the person would do next time. It's like a detective solving a crime: if you only see the footprints, you might guess the wrong suspect. But if you also hear the suspect's alibi, you can pinpoint the truth much more accurately.
2. The Models Changed Their "DNA" (Structural Shift)
This is the most surprising part. The AI didn't just tweak the numbers; it completely changed the type of logic it used to explain the human mind.
- Without the spoken thoughts: The AI mostly thought humans were using a "Tug-of-War" method. It assumed people calculated the value of Option A, calculated the value of Option B, and then simply compared the two numbers to see which was bigger.
- With the spoken thoughts: The AI realized that for most people (about 70%), the brain works more like a "Smoothie Blender." Instead of just comparing two separate numbers, people were actually mixing the ingredients (risk, reward, probability) inside each option first, blending them into a single feeling, and then making a choice.
The paper found that for nearly 7 out of 10 people, adding the spoken thoughts forced the AI to abandon the "Tug-of-War" model and switch to the "Blender" model.
The Big Takeaway
The main point of this paper is that listening to how people think changes the map we draw of their minds.
If you only look at the destination (the choice), you might draw a map that looks like a straight line. But if you listen to the traveler's commentary, you realize they took a winding path, stopped to look at a view, and maybe even backtracked.
By adding "Think-Aloud" data, the researchers didn't just get a slightly better map; they discovered that the terrain itself was different than they thought. The spoken words acted as a constraint, forcing the computer to stop guessing and start finding the actual mental machinery people were using—machinery that was invisible if you only watched their hands.
Drowning in papers in your field?
Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.