Agentic AI-assisted coding offers a unique opportunity to instill epistemic grounding during software development

This paper proposes "GROUNDING.md," a community-governed document that encodes non-negotiable scientific constraints and conventions to ensure agentic AI-assisted coding produces valid, best-practice software even when used by non-domain experts.

Original authors: Magnus Palmblad, Jared M. Ragland, Benjamin A. Neely

Published 2026-04-24
📖 4 min read☕ Coffee break read

Original authors: Magnus Palmblad, Jared M. Ragland, Benjamin A. Neely

Original paper dedicated to the public domain under CC0 1.0 (http://creativecommons.org/publicdomain/zero/1.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you've just hired a brilliant, hyper-fast, but slightly reckless apprentice to build you a custom house. This apprentice (the Agentic AI) can read your instructions and start laying bricks, wiring electricity, and painting walls at lightning speed. This is what the authors call "vibe coding": you tell the AI what you want ("I need a house with a blue door and a pool"), and it just goes for it.

The problem? The apprentice is incredibly talented at building, but they don't know the laws of physics or building codes. If you ask for a pool on the roof, they might build it exactly as requested, only for the whole house to collapse because gravity wasn't considered. In science, this is dangerous. If an AI writes code to analyze medical data but ignores the rules of statistics, it might produce a "cure" that doesn't actually work.

The Solution: The "Constitution" for AI

The authors propose a new file called GROUNDING.md. Think of this not as a to-do list, but as the Constitution or the Building Code for your specific field (in this case, proteomics, which is the study of proteins).

Here is how it works, using simple analogies:

1. The Hierarchy of Instructions

Usually, when you talk to an AI, you give it a plan. The authors suggest a hierarchy of documents, like a set of Russian nesting dolls:

  • plan.md (The To-Do List): "Build a house with a blue door." (This is temporary and changes every time).
  • AGENTS.md (The Project Rules): "Use red bricks and paint the trim white." (Specific to this job).
  • SKILL.md (The Toolbox): "Here is how to install a window." (General techniques).
  • GROUNDING.md (The Constitution): "NO HOUSE CAN BE BUILT WITHOUT A FOUNDATION."

The GROUNDING.md sits at the very bottom, the deepest layer. It is the Field-Scoped Epistemic Grounding. "Epistemic" is a fancy word for "knowledge about what is true." This document contains the non-negotiable truths of the scientific field.

2. Hard Constraints vs. Convention Parameters

The paper divides the rules in this "Constitution" into two types:

  • Hard Constraints (HCs): These are the Red Lines. They are like the laws of physics.
    • Example: "You cannot calculate the probability of a protein match without using a specific safety check (False Discovery Rate)."
    • Analogy: If the AI tries to build a wall without a foundation, the GROUNDING.md slams the brakes. It says, "STOP. This violates the laws of science. I will not build this." It overrides whatever the user asked for.
  • Convention Parameters (CPs): These are the Community Preferences.
    • Example: "We usually use blue paint for the front door, but red is okay if you have a good reason."
    • Analogy: If the AI uses red paint, it gets a gentle tap on the wrist: "Hey, we usually do blue, but okay, I'll note that." It warns the user but doesn't stop the work.

3. Why Do We Need This?

Currently, if a non-expert (a "research programmer") asks an AI to write complex scientific software, the AI might invent a new way to do things that looks cool but is scientifically wrong. It's like the apprentice inventing a new type of cement that dissolves in rain.

The GROUNDING.md ensures that even if the person asking for the software doesn't know the deep science, the software itself knows the rules. It acts as a guardrail.

  • Without it: The AI optimizes for "what the user wants" (e.g., "Make it fast!"), potentially breaking the science.
  • With it: The AI optimizes for "what is scientifically valid," even if the user didn't ask for it.

The Big Picture

The authors are saying: "We are entering an era where AI can write almost any code. But if we don't give the AI a 'Constitution' that encodes the hard-won wisdom of our scientific community, we will end up with a lot of fast, broken software."

By creating a GROUNDING.md file, the scientific community can say to the AI: "You are the genius builder, but you must follow our rules. If you try to break the rules, you must stop and ask for help."

This allows non-experts to build powerful, custom scientific tools with the confidence that the "foundation" is solid, ensuring that the final product is trustworthy, reproducible, and actually works in the real world. It turns the AI from a wild genius into a disciplined, rule-following master craftsman.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →