Modelling Language using Large Language Models

Here is an explanation of the paper, broken down into simple concepts with some creative analogies.

The Big Question: Are AI Chatbots Just Parrots or Real Scientists?

Imagine you have a super-smart parrot. This parrot has read every book, tweet, and article on the internet. If you ask it a question, it answers perfectly. It knows grammar, facts, and how to tell a joke.

Now, a famous linguist (like Noam Chomsky) might say: "That's just a parrot! It's mimicking sounds. It doesn't actually understand the rules of language or how our brains work. It's just a fancy tape recorder."

But the author of this paper, Jumbly Grindrod, says: "Wait a minute! We are looking at this the wrong way. We shouldn't be asking if the parrot understands the human brain. We should be asking if the parrot is a perfect map of human language as a social thing."

Here is the breakdown of the argument:

1. Two Different Kinds of "Language"

To understand the paper, we need to split "language" into two buckets:

Bucket A: The Internal Brain (I-Language). This is the secret, biological software inside your head. It's the invisible rules that make you know that "The cat sat on the mat" is good, but "Mat the on sat cat the" is nonsense. Chomsky thinks this is the only thing linguists should study.
Bucket B: The Public Social Thing (E-Language). This is language as it exists out in the world. It's the messy, shared agreement between millions of people. It's the slang, the dialects, the grammar rules we all agree to use so we can talk to each other. It's like a giant, invisible web connecting everyone who speaks English.

The Paper's Argument:
Most people think AI (Large Language Models or LLMs) fails because it doesn't perfectly mimic Bucket A (the human brain). The author says, "Who cares? Let's stop trying to make AI a psychologist and start using it as a sociologist."

The author argues that AI is actually a fantastic tool for studying Bucket B (the public language).

2. The "Map vs. Territory" Analogy

The author says AI shouldn't be thought of as a "Theory" (a set of rules written in a textbook). Instead, it should be thought of as a Scientific Model.

The Theory: A textbook that says, "Birds fly because of lift and drag."
The Model: A wind tunnel with a plastic bird in it. You can't explain why the bird flies just by looking at the plastic, but you can watch the plastic bird fly and learn a lot about how real birds behave.

The Analogy:
Think of an LLM as a giant, digital simulation of a city.

It's not a real city (it's not made of brick and mortar).
It's not the people living there (it's not a human brain).
But it is a perfect simulation of how traffic flows, where people shop, and how the streets connect.

If you want to study how traffic jams happen in a real city, you don't need to be a traffic cop inside every car. You can build a computer simulation (a model) that mimics the traffic patterns. Even if the simulation isn't "real," it teaches you valuable things about the real world.

The author says: LLMs are the traffic simulation for language. They aren't trying to be the human brain; they are trying to be a proxy for the English language itself.

3. The "Black Box" Problem (and How We Solve It)

The Objection:
Critics say, "But these AI models are 'Black Boxes.' We put text in, and text comes out, but we have no idea what's happening inside the machine. It's like a magic trick. If we can't see the gears, how can it be a scientific model?"

The Author's Reply:
Imagine you are trying to understand how a giant, complex clock works. You can't see the gears because they are covered by a metal case.

Old way: You just guess how it works.
New way (The Author's way): You use a special X-ray camera (called Explainable AI or XAI).

The paper points out that scientists are now developing "X-ray cameras" for AI. They can peek inside and see that when the AI processes a sentence, specific parts of its "brain" light up exactly when it's thinking about grammar, and other parts light up when it's thinking about meaning.

Even though we can't control every single gear, we can see enough to say, "Okay, this part of the model represents the rule of 'Subject-Verb Agreement'." This allows us to treat the AI as a valid scientific model.

4. The "Stuck in the Past" Objection

The Objection:
Critics say, "The AI is just a mirror. It only knows what it read in its training data. It's just a compressed version of the internet. It's not modeling 'Language'; it's modeling 'The Internet'."

The Author's Reply:
Imagine you are teaching a student to write a story.

The Bad Student: Memorizes the book word-for-word. If you ask them to write a new story, they can't. They just recite the old one. This is overfitting (memorizing the data).
The Good Student: Reads the book, understands the patterns of storytelling, and then writes a new story they've never seen before.

The author argues that modern AI is the Good Student.

It is trained to predict the next word, but it is also tested on whether it understands logic, grammar, and nuance (using things like the GLUE benchmark).
If the AI were just memorizing the internet, it would fail these tests.
Because it passes these tests, it proves it has learned the underlying rules (the conventions) of the language, not just the specific words in the book.

It's like a chef who tastes a million recipes. They don't just memorize the recipes; they learn the principles of cooking (heat, salt, texture). Now, they can cook a dish they've never seen before. The AI has learned the "principles" of English.

The Bottom Line

The paper is an invitation to change our perspective.

Don't ask: "Does this AI think like a human?" (The answer is probably no).
Do ask: "Does this AI act like a perfect map of how humans use language together?" (The answer is yes).

By treating AI as a scientific model of public language, we can use it to discover new things about how our language works, how it changes, and how we communicate, without needing to solve the mystery of the human brain first. It turns the AI from a "parrot" into a powerful laboratory for language.

Modelling Language using Large Language Models

The Big Question: Are AI Chatbots Just Parrots or Real Scientists?

1. Two Different Kinds of "Language"

2. The "Map vs. Territory" Analogy

3. The "Black Box" Problem (and How We Solve It)

4. The "Stuck in the Past" Objection

The Bottom Line

1. The Problem

2. Methodology

3. Key Contributions

4. Results and Arguments

5. Significance

Modelling Language using Large Language Models

The Big Question: Are AI Chatbots Just Parrots or Real Scientists?

1. Two Different Kinds of "Language"

2. The "Map vs. Territory" Analogy

3. The "Black Box" Problem (and How We Solve It)

4. The "Stuck in the Past" Objection

The Bottom Line

1. The Problem

2. Methodology

3. Key Contributions

4. Results and Arguments

5. Significance

More like this

One Language, Two Scripts: Probing Script-Invariance in LLM Concept Representations

MultiGraSCCo: A Multilingual Anonymization Benchmark with Annotations of Personal Identifiers

ConFu: Contemplate the Future for Better Speculative Sampling

SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex Computation

Automated Thematic Analysis for Clinical Qualitative Data: Iterative Codebook Refinement with Full Provenance