Efficient Learning of Predictive Maps for Flexible Planning

This paper introduces the Successor Representation with Importance Sampling (SR-IS), a novel model that constructs policy-independent predictive maps to enable efficient, flexible planning and better explains human replanning biases compared to existing theories.

Original authors: Bazarjani, A., Piray, P.

Published 2026-06-22
📖 3 min read☕ Coffee break read

Original authors: Bazarjani, A., Piray, P.

Original paper licensed under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine your brain is like a highly skilled tour guide trying to navigate a massive, ever-changing city. To get from point A to point B quickly, the guide needs a mental map.

The Old Problem: The "Routine" Map
For a long time, scientists thought the brain used a specific type of map called the "Successor Representation." Think of this like a commuter's subway map. It's incredibly useful if you always take the same route to work every day. It tells you exactly which stations you'll pass through based on your usual habits.

But here's the catch: this map is rigid. If the city suddenly closes a tunnel or you decide you want to go to a different neighborhood, the commuter map is useless. It's so tied to your old routine that it can't easily help you plan a new trip. In the paper's terms, this map is "policy-dependent," meaning it only works if you stick to your current plan.

The New Solution: The "Explorer's" Map (SR-IS)
The authors of this paper introduce a new model called SR-IS (Successor Representation with Importance Sampling). You can think of this as upgrading from a rigid subway map to a dynamic, GPS-style explorer's map.

Instead of just recording where you usually go, this new model uses a clever trick (called "importance sampling") to learn the layout of the entire city, regardless of which path you are currently walking. It's like a guide who learns the streets by observing traffic patterns and potential detours, not just by following their own daily commute.

Why This Matters
Because this new map isn't stuck on one specific route, it has two superpowers:

  1. Instant Adaptation: If the environment changes (like a road closure), the guide can instantly update the map and suggest a new path without having to relearn the whole city from scratch.
  2. Better Human Simulation: The paper shows that this model explains how humans actually think better than the old models. When people are asked to change their plans, they don't just flip a switch; they show "graded biases" (small, logical hesitations or adjustments). The old maps couldn't explain these subtle human quirks, but the new SR-IS model matches human behavior perfectly.

The Bottom Line
This paper bridges the gap between how we think the brain builds mental maps and how we actually behave when we need to be flexible. It suggests that our brains might be using a smart, adaptable system (SR-IS) that allows us to learn the structure of our world once, and then use that knowledge to plan any future journey, no matter how the rules change.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →