Why Conclusions Diverge from the Same Observations: Formalizing World-Model Non-Identifiability via an Inference

This paper formalizes the phenomenon of divergent conclusions from identical observations as a structural issue of world-model non-identifiability arising from differing inference profiles and learning biases, rather than as a defect of rationality or good faith.

Original authors: Toru Takahashi

Published 2026-05-13✓ Author reviewed
📖 6 min read🧠 Deep dive

Original authors: Toru Takahashi

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine two people sitting at a table, looking at the exact same pile of documents, charts, and news reports. Yet, one person concludes, "We need to stop this project immediately," while the other says, "We should double down and move faster."

In the real world, we often react to this by saying, "One of them is crazy," "They are lying," or "They just don't get it." We assume the problem is a character flaw.

This paper argues that we are looking at the wrong thing. It suggests that the disagreement isn't about who is looking, but how they are looking. The author, Toru Takahashi, proposes that when people share the same facts but reach different conclusions, it's not a defect in their brains—it's a mathematical inevitability called non-identifiability.

Here is the paper's argument broken down into simple concepts and analogies.

1. The Core Idea: The "Same Input, Different Output" Problem

The paper starts by rejecting the idea that there is only one "correct" way to think (which it calls the Single Intelligence Assumption). Instead, it suggests that thinking is like a machine with many dials. Even if two people feed the exact same data into their brains, if they turn the dials differently, they will get different answers.

The author splits this into two levels of "glitches":

  • Level 1: The Settings Glitch (θ\theta-level). Imagine two chefs using the exact same recipe and the exact same ingredients. One chef decides to add a pinch of salt, cook it for 5 minutes, and taste it immediately. The other chef adds no salt, cooks it for 20 minutes, and tastes it slowly. They end up with different dishes, not because the ingredients were bad, but because their settings were different.
  • Level 2: The Memory Glitch (WW-level). Now, imagine those chefs keep cooking every day. The first chef only ever cooks dishes that are salty and fast. The second only cooks slow, bland dishes. Over time, their memory of what "good food" is changes. They have built different internal models of the world. Now, even if you give them the same new ingredient, they will interpret it differently because their past experiences have shaped their brains to expect different things.

2. The Four Dials of Thinking

To explain why people think differently, the author introduces a "Thinking Profile" with four adjustable dials. Think of these as the settings on a camera or a video game:

  1. Reference (R): What do you trust?
    • Do you trust hard numbers, logs, and legal text (things you can show a friend and say, "Look, it's right here")? Or do you trust gut feelings, unspoken risks, and intuition (things that are hard to explain)?
    • Analogy: One person drives by looking strictly at the speedometer and GPS. The other drives by looking at the road, the wind, and a "feeling" that something is wrong.
  2. Exploration (E): How many possibilities do you keep open?
    • Do you quickly decide on one answer and stick to it? Or do you keep many "what if" scenarios running in your head at once?
    • Analogy: A detective who immediately arrests the first suspect vs. a detective who keeps a list of ten suspects and investigates all of them.
  3. Stabilization (S): How hard is it to change your mind?
    • When new info arrives, do you instantly update your plan? Or do you stick to your original rule unless the new info is overwhelming?
    • Analogy: A thermostat that changes the temperature the second the room feels a degree warmer vs. one that waits until the room is freezing before turning on the heat.
  4. Horizon (D): How far into the future do you look?
    • Do you care about what happens next week? Or next decade?
    • Analogy: A farmer who plants crops for next month's market vs. one who plants trees that won't bear fruit for 20 years.

3. Why Do We Argue About the Same Three Things?

You might think there are infinite ways to disagree. But the paper argues that because our brains have limits (we can't process infinite data, we can't see everything, and we have to talk to each other), these four dials tend to collapse into just three main arguments:

  1. Abstract vs. Concrete:
    • The Conflict: One person wants to talk about big, general principles (Abstract). The other wants to talk about specific, messy details (Concrete).
    • The Cause: Our brains have to compress information to fit it in. Sometimes we compress too much (losing details), and sometimes we hold onto too much detail (losing the big picture).
  2. External vs. Internal:
    • The Conflict: One person says, "Show me the data!" (External). The other says, "You just don't understand the risk I feel!" (Internal).
    • The Cause: It's hard to share your internal feelings. It's easy to share a spreadsheet. People argue over whether the "feelings" count as valid evidence.
  3. Order vs. Freedom:
    • The Conflict: One person wants strict rules and consistency (Order). The other wants flexibility and new ideas (Freedom).
    • The Cause: We have to balance stability (not changing our minds every second) with adaptability (changing our minds when we learn something new).

4. A Real-World Example: AI Regulation

The paper uses the debate over regulating Artificial Intelligence to show how this works.

  • The Shared Facts: Everyone sees the same reports on AI accidents, economic growth stats, and technical benchmarks.
  • The "Precautionary" Group:
    • Reference: They focus on hard-to-externalize fears (e.g., "What if we lose control?").
    • Exploration: They keep "worst-case scenarios" alive in their minds.
    • Stabilization: They want strict, unchangeable rules.
    • Horizon: They look 50 years into the future.
    • Conclusion: "Ban it or regulate it heavily."
  • The "Promotion" Group:
    • Reference: They focus on externalizable data (e.g., "Look at these economic numbers").
    • Exploration: They focus on the most likely, positive scenarios.
    • Stabilization: They want flexible rules that can change as tech evolves.
    • Horizon: They look at the next 2–5 years.
    • Conclusion: "Let it grow; we can fix problems later."

The paper says: Neither side is "crazy." They are just using different settings on their thinking machine.

5. The Solution: Stop Blaming, Start Tuning

The paper's main takeaway is that we should stop calling people "irrational" or "bad faith" when they disagree. Instead, we should treat disagreement like a technical problem.

If two people disagree, we shouldn't ask, "Who is stupid?" We should ask:

  • "Are you looking at different parts of the data?" (Reference)
  • "Are you holding onto different possibilities?" (Exploration)
  • "Are you looking at different timeframes?" (Horizon)

By identifying which "dial" is turned differently, we can design better ways to talk. We can agree to look at the same timeframe, or agree to share the same "gut feelings" as data. This turns a moral fight into a solvable engineering problem.

In short: Disagreement isn't a sign of a broken brain; it's a sign of different settings on the same machine. If we understand the settings, we can fix the disagreement.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →