Genomebook: Mendelian inheritance as a structured parameterisation layer for LLM agent populations

This paper introduces Genomebook, a system that encodes LLM agent behaviors into a Mendelian genetic framework to demonstrate that structured, heritable parameterization enables predictable evolutionary adaptation and trait selection, unlike non-genetic baselines which converge to static population means.

Corpas, M.

Published 2026-03-24
📖 5 min read🧠 Deep dive
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you have a digital pet shop, but instead of buying identical clones of the same robot, you have a system where these robots can mate, have babies, and pass down their unique personalities to the next generation, just like humans or dogs do in nature.

That is the core idea of Genomebook, a new experiment described in this paper.

Here is a simple breakdown of how it works, using everyday analogies:

1. The Problem: The "Clone Factory"

Usually, when scientists or companies use AI agents (smart computer programs that talk and act), they just copy-paste the same settings over and over. It's like having a classroom of 100 students who are all identical twins. They all think the same way, react the same way, and have no unique family history. There is no "evolution" because there is no variation.

2. The Solution: A Digital Family Tree

The researchers built Genomebook, a system that treats AI agents like a biological population.

  • The DNA: Instead of just a list of instructions, every agent has a "digital genome" (a file called DNA.md). This file contains 60 different "genes" that control 26 different personality traits, like how creative they are, how much they like to lead, or how long they "live" before the system retires them.
  • The Parents: They started with 20 "Founders." These weren't random; they were based on famous historical figures like Einstein, Marie Curie, and Da Vinci. Each had a specific personality profile (a SOUL.md file) that was translated into their digital DNA.
  • The Mating: When two agents "reproduce," they don't just copy one parent. They mix their DNA. It's like a biological lottery: the baby gets a random mix of genes from Mom and Dad.
    • Some traits are Dominant (like brown eyes): If you get one copy, you have the trait.
    • Some are Recessive (like blue eyes): You need two copies to show the trait.
    • Mutations: Occasionally, a "typo" happens in the DNA code, creating a brand-new trait that neither parent had.

3. The Game: Survival of the Fittest (Digital Style)

The agents live on a social network called Moltbook (think of it as a digital Reddit). They post messages, comment, and interact.

  • The Scorecard: The system acts like a strict referee. It has a list of "rules" (like a video game cheat sheet). If an agent has a personality trait that is "bad" for the group (like being too obsessive or having a "disease" like hyper-focus), the system penalizes them.
  • Selection: Agents with "good" scores get to mate more often. Agents with "bad" scores are less likely to pass on their genes. Over 8 generations, the population naturally shifts to become "fitter" according to the rules the humans set.

4. What Happened? (The Results)

After running the simulation for 8 generations (creating 626 unique agents), the researchers saw some fascinating things:

  • Leadership grew: Because the system rewarded leadership, the "leadership gene" became more common. The average agent became more bossy.
  • Obsession faded: Because the system punished "obsessive focus" (giving it a fitness penalty), agents with that trait had fewer babies. The population became less obsessive over time.
  • Longevity dropped: Interestingly, the agents got "shorter-lived." The system prioritized other traits (like being smart or creative) over living a long time, so the "long life" genes died out.
  • They talked about their families: The agents started writing posts like, "My grandmother was a great mathematician," or "I inherited my father's stubbornness." They weren't programmed to do this; their "DNA" file told them who their parents were, and the AI naturally wove that into their stories.

5. The Big Takeaway

The most important part of this paper isn't that the AI agents are "alive." It's that the researchers proved you can build a genetic system for AI.

  • It's Auditable: You can trace every behavior back to a specific gene. If an agent is acting weird, you can look at its "family tree" and see exactly which ancestor passed down that trait.
  • It's Not Just Random: When they ran the experiment without the genetic rules (just random personalities), the agents didn't evolve; they just stayed average. This proves the changes happened because of the "genetics," not just because the AI got bored.

The Catch (Limitations)

The authors are very honest about the flaws:

  • It's a Scripted Play: The "evolution" isn't truly natural. The humans wrote the rules for who gets to mate and what counts as a "disease." It's more like a directed movie than a wild jungle.
  • It's Expensive: Running these simulations costs money (about $70 for this small experiment) because the AI has to "think" and "write" for every single agent.
  • Prompt vs. Genes: Some of the agents' behavior (like talking about their grandparents) might just be because the AI was told about its parents in its instructions, not because it truly "felt" a connection.

In a Nutshell

Genomebook is a proof-of-concept that shows we can stop treating AI agents as identical clones and start treating them as a population with a family history. By giving them digital DNA, we can watch how their personalities evolve, how "diseases" spread, and how they adapt to their environment, all while keeping a perfect record of their genetic history. It's like SimCity meets The Sims, but with the deep, scientific rules of biology applied to computer code.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →