The Periaqueductal Gray Selectively Supports Reversal Learning During a Flexible Discrimination Task in Mice

Using whole-brain fMRI in behaving mice combined with reinforcement learning modeling, this study reveals that the periaqueductal gray (PAG) is selectively recruited during reversal learning to suppress previously rewarded actions and update behavior, highlighting a novel role for this midbrain structure in cognitive flexibility beyond its traditional association with threat processing.

Lichtman, D., Bergmann, E., Nicholas, J., Gerraty, R. T., Kahn, I.

Published 2026-04-01
📖 4 min read☕ Coffee break read
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine your brain is a highly sophisticated navigation system, like a GPS in a car. Usually, this GPS works great: it tells you that "Turn Left at the big oak tree" leads to a delicious pizza (a reward). You learn this rule, and you drive there happily every day.

But what happens when the pizza place closes down and a new one opens up on the right? Your old GPS is now stuck on autopilot, trying to take you to the closed shop. To get the new pizza, you have to unlearn the old rule and learn a new one. This ability to change your mind when the rules of the game change is called cognitive flexibility.

For a long time, scientists thought this "re-routing" was handled mostly by the brain's "reward center" (the part that gets excited about pizza). But this new study, using a high-tech MRI scanner on mice, discovered a surprise player in this process: a tiny, ancient part of the brain called the Periaqueductal Gray (PAG).

Here is the story of what they found, broken down simply:

1. The Experiment: The "Sniff and Lick" Game

The researchers set up a game for mice inside an MRI machine.

  • The Setup: Mice were trained to sit still while smelling two different scents.
  • The Rule (Acquisition): Scent A meant "Lick the tube to get a water drop!" (Reward). Scent B meant "Don't lick, or you get nothing."
  • The Twist (Reversal): Once the mice mastered the game, the researchers flipped the script. Suddenly, Scent A meant "Don't lick," and Scent B meant "Lick for water!"

The mice had to figure out that the old rules were broken and learn the new ones quickly.

2. The Old Theory vs. The New Discovery

Scientists already knew that the Nucleus Accumbens (let's call it the "Happy Button") lights up when a mouse does the right thing and gets a reward. It's like a cheerleader shouting, "Yes! Do that again!"

However, when the rules changed, the researchers noticed something strange happening in the PAG.

  • The PAG's Job: Think of the PAG as the brain's "Brake Pedal" or "Stop Sign."
  • What it did: When the mouse successfully stopped itself from licking at the wrong scent (the "No-Go" signal), the PAG lit up like a Christmas tree.
  • The Twist: The PAG didn't care about the reward; it cared about inhibition. It was the part of the brain saying, "Wait! Don't do that old thing anymore! Stop!"

3. The "Double Dissociation" (The Perfect Team)

The study found a beautiful teamwork dynamic between the two brain regions during the rule change:

  • The Happy Button (Nucleus Accumbens): Shouted "GO!" when the mouse licked at the new correct scent.
  • The Brake Pedal (PAG): Shouted "STOP!" when the mouse resisted licking at the old wrong scent.

It's like driving a car: You need the gas pedal (Happy Button) to move forward toward the new goal, but you desperately need the brakes (PAG) to stop yourself from crashing into the old, familiar path.

4. Why This Matters

Here is the most surprising part: The PAG only showed up when the rules changed. When the mice were just learning the game for the first time, the PAG was quiet. It only woke up when the mice had to unlearn something they already knew.

This suggests that the PAG isn't just for dealing with pain or fear (which is what we used to think it did). Instead, it's a crucial part of our ability to be flexible. It helps us suppress our habits when the world changes, even if no one is punishing us for being wrong—we just have to realize the old way doesn't work anymore.

The Big Picture

This study is like finding a new, essential gear in a watch that everyone thought was already perfect. It shows that our ability to adapt isn't just about chasing rewards; it's equally about having a specialized "brake system" in our brain that helps us let go of the past.

By using MRI to watch the whole mouse brain at once, the researchers found that cognitive flexibility is a team sport involving both the "Go" signals and the "Stop" signals working in perfect harmony. Without that "Stop" signal from the PAG, we would all be stuck driving in circles, trying to find a pizza place that closed years ago.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →