Determinants of visual ambiguity resolution

By analyzing a large dataset of ambiguous images and human ratings, this study reveals that subjective identification relies primarily on the preservation of high-level visual features, exhibits a flexible shift from top-down to bottom-up processing upon disambiguation, and follows a non-linear U-shaped relationship with information gain where clarity improves when new information either strongly confirms or contradicts prior predictions.

Original authors: Linde-Domingo, J., Ortiz-Tudela, J., Voeller, J., Hebart, M. N., Gonzalez-Garcia, C.

Published 2026-03-05
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are walking down a foggy road at night. You see a dark, blurry shape in the distance. Is it a person? A mailbox? A stray dog? Your brain is trying to guess, but the information is incomplete. This is visual ambiguity.

This paper is like a massive experiment where researchers handed 1,000 people a giant box of "foggy photos" (called Mooney images—black and white pictures that look like random blobs until you know what they are) and asked them to guess what the objects were. Then, they showed them the clear, normal photo of the same object, and asked them to look at the foggy photo again.

Here is what they discovered, explained through simple analogies:

1. The "High-Level" vs. "Low-Level" Puzzle

Think of your brain's visual system like a two-step detective process:

  • High-Level Features (The Big Picture): These are the general ideas, like "it's an animal" or "it's a vehicle."
  • Low-Level Features (The Details): These are the specific lines, curves, and edges.

The Finding: When the foggy photo is first shown, your brain is desperate for the Big Picture. If the foggy photo has lost the "animal-ness" or "vehicle-ness" (the high-level features), you can't guess it, no matter how hard you try. The low-level details (the squiggly lines) don't matter much yet because you don't have a hypothesis to test.

The Twist: Once you see the clear photo and "get it," your brain changes tactics. Suddenly, those specific squiggly lines (low-level features) become super important. Your brain now uses the clear photo as a template and checks: "Do the lines in this foggy blob match the lines of the dog I just saw?"

Analogy: Imagine trying to identify a friend in a crowd wearing a mask.

  • Before you know who it is: You need to see their face shape or height (High-Level). If the mask hides their whole face, you are stuck.
  • After you know it's your friend: You stop looking at the face shape. Instead, you look for a specific scar on their chin or the way they hold their coffee cup (Low-Level). Now that you have the "answer," the tiny details confirm your guess.

2. The "U-Shaped" Surprise

You might think that the more information you get, the easier it is to guess. If the fog clears a little bit, you should get a little better at guessing, right?

The Finding: Not exactly. The relationship is U-shaped.

  • Scenario A (Total Confusion): You guess wildly wrong. Then you see the clear photo. The information gain is huge. You go from "I have no idea" to "Oh, it's a toaster!" Result: You feel very clear about what it is.
  • Scenario B (Total Clarity): You guessed correctly (or very close) before seeing the clear photo. The clear photo just confirms you. The information gain is tiny. Result: You still feel very clear about what it is.
  • Scenario C (The Middle Ground): You guessed something sort of related but wrong. Then you see the clear photo. It doesn't fully confirm your guess, but it changes your mind. This "middle ground" of information gain actually makes you feel less certain than the other two extremes.

Analogy: Think of it like solving a riddle.

  • If the answer is completely different from what you thought, the "Aha!" moment is huge and satisfying.
  • If the answer is exactly what you thought, the "Aha!" moment is a satisfying nod of confirmation.
  • But if the answer is almost what you thought, but slightly different, it leaves you feeling confused and unsure. "Wait, was it a cat or a raccoon?" That middle ground is the bottom of the "U."

3. The "Group Consensus" Effect

Before seeing the clear photo, if you asked 100 people to name the blob, they would all say different things (high "entropy" or confusion). One says "cloud," another says "shoe," another says "cloud."

After seeing the clear photo, if you ask them again, they all suddenly agree. They all say "It's a shoe." The group becomes much more consistent. The researchers found that this shift from chaos to agreement happens almost instantly once the "key" (the clear photo) is revealed.

The Big Takeaway

Our brains are not just passive cameras that record what we see. They are active guessing machines.

  1. When we are confused: We rely on our brain's "big picture" predictions. If the picture is too blurry to match a big idea, we are stuck.
  2. When we learn the answer: Our brain switches to "detail mode," checking the tiny lines to confirm the match.
  3. The learning curve: We learn best when we are either totally wrong (and get a big correction) or totally right (and get confirmation). Being "sort of right" is the most confusing state of all.

This study helps us understand how our brains turn a blurry, confusing world into a clear, understandable one, and why sometimes getting more information doesn't always make us feel clearer.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →