WELLDOC property for words generated by morphisms

This paper investigates the WELLDOC property, an abelian-type characteristic of infinite words concerning the distribution of Parikh vectors of prefixes, and establishes a criterion for determining whether words generated by morphisms possess this property.

Svetlana Puzynina, Vladimir Schavelev

Published Tue, 10 Ma
📖 5 min read🧠 Deep dive

Imagine you are baking a giant, infinite loaf of bread. This isn't just any bread; it's made by a magical recipe (a morphism) that takes a small piece of dough, stretches it out, and replaces every crumb with a specific pattern of new crumbs. You keep doing this forever, creating a never-ending sequence of flavors.

Now, imagine you are a food critic trying to taste every possible combination of flavors in this infinite loaf. You want to know: Is the bread "fairly" distributed?

This paper is about a specific kind of fairness called WELLDOC (Well Distributed Occurrences).

The Big Idea: The "Fairness" Test

Let's say you are looking for a specific flavor combination, like "Chocolate-Chip" (a factor). You want to find every time "Chocolate-Chip" appears in the infinite loaf.

The WELLDOC property asks a very strict question:

"No matter what 'flavor profile' (a specific count of ingredients) you want to see before the Chocolate-Chip appears, can you find a spot in the bread where that profile exists?"

To make this concrete, imagine the bread is made of three ingredients: Flour, Sugar, and Salt.

  • The Parikh Vector: This is just a tally sheet. If you look at the first 100 crumbs, how many are Flour? How many are Sugar? How many are Salt? That's your vector.
  • The Modulo Test: The paper asks: "If I want to find a spot where, before the Chocolate-Chip, I have seen exactly 3 more Sugars than Flours (modulo 5), can I find it?"

If the answer is "Yes" for every possible ingredient count and every possible number you choose, the bread has the WELLDOC property. It means the ingredients are scattered so perfectly that you can find any statistical pattern you want before any specific flavor appears.

The Problem: The "Lattice" Trap

Why do we care? The authors explain that this concept comes from pseudorandom number generators (the math behind computer randomness).

Imagine a machine that generates numbers. Sometimes, these machines have a hidden flaw called a "lattice structure." It's like if you tried to scatter marbles on a floor, but they accidentally lined up in perfect, invisible rows and columns. If you look at the floor from a certain angle, you see gaps where no marbles exist. A truly random (or "well-distributed") floor should have marbles everywhere, with no predictable gaps.

The authors found that if a word (or a sequence of numbers) has the WELLDOC property, it has no lattice structure. It is perfectly "messy" in a good way.

The Solution: The Magic Recipe (Morphisms)

The paper focuses on words generated by morphisms (the magical recipes). The big question was: How do we know if a specific recipe will produce a perfectly fair loaf?

The authors discovered a simple "litmus test" based on the recipe's Matrix (a grid of numbers that describes how the recipe transforms ingredients).

1. The Binary Case (Two Ingredients: 0 and 1)

If your bread only has two ingredients (like a simple binary code), the test is incredibly simple:

  • Look at the recipe's matrix.
  • Calculate its Determinant (a single number that tells you how the recipe stretches or shrinks space).
  • The Rule: If the determinant is 1 or -1, the bread is perfectly fair (WELLDOC). If it's anything else (like 2, 3, or 4), the bread has "gaps" (lattice structure) and fails the test.

Analogy: Think of the determinant as the "stretch factor." If you stretch a rubber sheet by a factor of 2, you leave empty spaces. If you stretch it by exactly 1 (or flip it, -1), you cover every inch perfectly without gaps.

2. The Complex Case (Three or More Ingredients)

If you have 3, 4, or more ingredients, the "Determinant = 1" rule is necessary but not enough.

  • Condition A: The determinant must still be 1 or -1.
  • Condition B: You must also check the "Return Words."
    • What is a Return Word? Imagine you are walking through the bread. Every time you see the first ingredient (say, "0"), you start counting. The "Return Word" is the path you take until you see "0" again.
    • The Rule: The "paths" (vectors) you take between every "0" must be able to build any possible combination of ingredients. If your paths are too repetitive or restricted, you'll miss some flavor profiles, and the bread fails.

Why This Matters

The authors didn't just find a rule; they proved it works both ways (if the rule holds, the property holds; if the property holds, the rule must hold).

They also showed that for complex recipes, you can algorithmically check if the "Return Words" are good enough. This means a computer can easily tell you if a specific mathematical recipe will generate a "perfectly random" sequence.

The Takeaway

In simple terms, this paper gives us a cheat sheet for creating perfect randomness using simple, repeating rules.

  • The Goal: Create a sequence where every pattern appears with every possible statistical background.
  • The Tool: A mathematical recipe (morphism).
  • The Test:
    1. Check the recipe's "stretch factor" (Determinant). It must be 1.
    2. (For complex recipes) Check if the "loops" between the start of the sequence can reach every corner of the ingredient space.

If you pass the test, you have a sequence that is mathematically "fair," with no hidden lattice structures, making it ideal for generating high-quality random numbers for cryptography, simulations, and computer science.