Beyond next-word prediction: hierarchical linguistic composition drives LLM-brain alignment in time

By manipulating linguistic features in predictability-matched sentences, this study demonstrates that hierarchical linguistic composition, particularly syntactic structure and associative semantics, significantly drives the alignment between LLM representations and human brain activity, while compositional semantics appears to be more uniquely encoded in the human brain.

Original authors: Zhao, J., Brennan, J. R.

Published 2026-05-16
📖 3 min read☕ Coffee break read

Original authors: Zhao, J., Brennan, J. R.

Original paper licensed under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine your brain and a super-smart computer (a Large Language Model, or LLM) are both listening to a story being read aloud. Scientists have noticed that when humans listen, their brain waves "dance" in a similar rhythm to the internal calculations the computer makes. But why? Is it just because both the brain and the computer are good at guessing what word comes next (like finishing a sentence), or is there something deeper about how they understand the structure of language?

This paper treats the brain and the computer like two different chefs trying to recreate the same dish. The researchers wanted to know: Are they just following the same recipe because they both know the ingredients well (statistical patterns), or do they actually understand the cooking process (hierarchical composition) in the same way?

To find out, the team served up sentences to both human volunteers (while recording their brain waves with an EEG cap) and a computer model called GPT-2. They carefully tweaked the sentences in three specific ways, making sure that the "guessability" of the words remained the same so that the only difference was the type of meaning:

  1. The "Grammar Skeleton" (Syntactic Structure): They looked at sentences with clear, organized grammar.

    • The Result: When the sentences had strong grammatical structure, the computer's internal "thoughts" and the human brain waves matched up even better. It's like finding that both the chef and the computer are using the same specific knife skills when chopping vegetables.
  2. The "Building Block Meaning" (Compositional Semantics): This is when the meaning of a phrase is built strictly from the meanings of its parts (like "red car" meaning a car that is red).

    • The Result: Surprisingly, when the sentences relied heavily on this kind of building-block meaning, the match between the computer and the brain dropped. It's as if the human chef started using a secret family technique that the computer simply didn't have. The human brain seems to handle this specific type of meaning in a unique way that the computer doesn't quite replicate.
  3. The "Word Association" (Associative Semantics): This is when words are linked by loose connections or habits (like thinking of "butter" when you hear "bread").

    • The Result: Changing these associations didn't change the match at all. The computer and the brain were already on the exact same page regarding these loose connections. It's like both chefs automatically knowing that "salt" goes with "pepper" without needing a special instruction.

The Big Takeaway
The study shows that the connection between human brains and AI isn't just about predicting the next word. The computer and the brain are actually "speaking the same language" when it comes to grammar and word associations. However, when it comes to building complex meanings from smaller parts, the human brain has a special, unique way of doing it that the computer hasn't quite mastered yet. The computer is a great mimic of our habits and rules, but our brains have a unique flair for constructing meaning that the machine is still learning to copy.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →