Simon's model does not produce Zipf's law: The fundamental rich-get-richer mechanism for any power-law size ranking

This paper corrects a fundamental flaw in Herbert Simon's 1955 model by demonstrating that a time-dependent innovation rate decaying as the inverse of the logarithm of the number of types is required to generate power-law size rankings, including Zipf's law, whereas Simon's original approach fails to do so.

Pablo Rosillo-Rodes, Julia Witte Zimmerman, Laurent Hébert-Dufresne, Peter Sheridan Dodds

Published 2026-04-16
📖 5 min read🧠 Deep dive

Imagine a bustling marketplace where new stalls open every day. Some stalls sell popular items (like "Apple" or "The"), while others sell niche goods (like "Zebra" or "Xylophone").

In this market, there is a famous rule called Zipf's Law. It says that the most popular item is twice as popular as the second most popular, three times as popular as the third, and so on. It's a pattern seen everywhere: in how often we use words, how big cities are, and how many people follow celebrities.

For decades, scientists thought they knew why this happened. They pointed to a model created by a man named Herbert Simon in 1955.

The Old Story: The "Rich-Get-Richer" Game

Simon's idea was simple and intuitive: The rich get richer.

  • Imagine a new word enters the market. It has a small chance of being a brand new invention (innovation).
  • But most of the time, people just copy what's already popular. If "Apple" is already the biggest seller, it's more likely to get the next customer than a tiny stall selling "Zebra."
  • Simon thought that if you tweak the "innovation rate" (how often new things appear), you could get any pattern you want, including the perfect Zipf's Law.

The Big Mistake: The "Winner-Takes-All" Crash

The new paper by Rosillo-Rodes and his team says: Simon's model is broken.

Here is the problem with Simon's logic, explained with a metaphor:
Imagine a race where the person in the lead gets a bigger and bigger head start every time a new runner joins.

  • If the "innovation rate" (the chance of a new runner starting) gets very, very low, Simon's math suggested the race would still look like a normal Zipf's Law.
  • Reality: If innovation stops almost completely, the first runner (the very first word or city) just keeps running forever while everyone else stays behind. The leader becomes a giant, and everyone else is tiny. The pattern breaks. Instead of a smooth curve, you get a "Winner-Takes-All" disaster where one thing dominates everything.

Simon's model fails exactly when it's supposed to work best: when trying to explain Zipf's Law.

The New Solution: The "Smart Pacing" Mechanism

The authors didn't just find the error; they fixed the engine. They discovered that for the "Rich-Get-Richer" mechanism to work perfectly and create Zipf's Law, the rate of innovation cannot be a fixed number. It must change over time.

Think of it like a conductor leading an orchestra:

  • Early on: The conductor needs to introduce many new instruments (new types) quickly to build the band.
  • Later on: As the orchestra gets huge, the conductor must slow down the introduction of new instruments. But here is the trick: they can't stop completely, or the first instrument will drown out the rest.
  • The Magic Formula: The paper proves that to get the perfect Zipf's Law, the rate of new inventions must slow down very specifically. It must slow down at the same speed as the logarithm of the number of things already in the system.

The Analogy:
Imagine you are filling a bathtub with water (tokens) and adding new colors (types).

  • Simon's way: You keep the faucet on a steady drip for new colors. Eventually, the first color you poured in turns into a massive, overwhelming ocean of blue, and the other colors are just a few drops.
  • The New Way: You have a smart faucet. As the tub fills with more colors, the faucet automatically slows down the rate at which you add new colors, but it does so in a very precise way (slowing down like 1/lnN1/\ln N). This ensures that the first color doesn't take over, and the water levels settle into that perfect, natural Zipf's Law curve.

Why This Matters

  1. It Fixes the Theory: For the first time, we have a working "Rich-Get-Richer" model that actually produces Zipf's Law without breaking.
  2. It Works in Real Life: The authors tested their new model against famous books (like Frankenstein, Don Quixote, and Harry Potter). Simon's model failed to match the word counts in these books, but the new "Smart Pacing" model got it right every time.
  3. Universal Rule: This isn't just about words. Whether it's cities, companies, or species, if a system follows a power-law ranking, it is likely following this specific "Smart Pacing" rule of innovation.

The Takeaway

The universe loves patterns, but the old explanation for why we have these patterns was slightly off. The new paper shows that for the "Rich-Get-Richer" effect to create a balanced, natural world (Zipf's Law), the system must be incredibly smart about when to introduce new things. It's not just about being rich; it's about knowing exactly when to stop adding new players so the game stays fair.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →