Lifetime Sample Tracking (LiST): A Data Platform for Materials Science

The Lifetime Sample Tracking (LiST) platform is a comprehensive data management system developed by the 2DCC-MIP to automate the curation, analysis, and dissemination of diverse materials science data, thereby enabling closed-loop synthesis-design iterations and supporting machine learning research for approximately twenty thousand samples.

Original authors: Anthony Richardella, Isaiah A Moses, Konrad Hilse, Frank Santaguida, Kevin Dressler, Ric Wilburn, Saiyyam Kochar, Wesley F. Reinhart, Adri C. T. van Duin, Nitin Samarth, Vincent H. Crespi, Joan M. Red
Published 2026-06-17
📖 5 min read🧠 Deep dive

Original authors: Anthony Richardella, Isaiah A Moses, Konrad Hilse, Frank Santaguida, Kevin Dressler, Ric Wilburn, Saiyyam Kochar, Wesley F. Reinhart, Adri C. T. van Duin, Nitin Samarth, Vincent H. Crespi, Joan M. Redwing

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine a massive, bustling kitchen where thousands of chefs are trying to invent the perfect new recipe for a dish called "2D Materials." These aren't just any dishes; they are ultra-thin layers of atoms used in advanced electronics and science.

The problem? Every chef (researcher) has their own way of cooking. Some use a high-tech oven (MOCVD), others use a vacuum chamber (MBE), and some bake them on different types of pans (substrates). In the past, if a chef made a great dish, they wrote down the recipe in a fancy notebook and published it. But if they burned the toast or made a bland soup, they threw it in the trash and never mentioned it. This made it hard for other chefs to learn what not to do, and it was impossible to know exactly how a specific dish was made years later.

Enter LiST: The Ultimate Recipe Book and Kitchen Manager

The paper introduces LiST (Lifetime Sample Tracking), a digital platform built by the 2D Crystal Consortium (2DCC-MIP) at Penn State. Think of LiST as a super-smart, automated kitchen manager that solves the chaos of the experimental kitchen.

Here is how it works, broken down into simple parts:

1. The "Black Box" Recorder (Automated Data Capture)

In a normal kitchen, a chef might forget to write down that they added a pinch of salt at 2:00 PM. In the LiST kitchen, the ovens and mixers are connected to a computer.

  • How it works: When a scientist starts growing a material, a "Gatekeeper" computer automatically grabs the recipe, the temperature, the timing, and the specific ingredients from the machine. It doesn't rely on human memory.
  • The Analogy: It's like a smart fridge that automatically logs every time you open the door, what you put inside, and exactly how long it stayed there, so you never have to guess later.

2. The "Full Life Story" (Tracking the Sample)

LiST doesn't just care about the final dish; it cares about the entire life of the ingredient.

  • The Analogy: Imagine tracking a tomato from the moment it was a seed, through the soil it grew in, the truck that carried it, the washing it received, and finally how it was chopped. LiST does this for atoms. It tracks the "substrate" (the base material) just as carefully as the final product.
  • Why it matters: If a material has a weird property, scientists can look back in LiST to see if it was caused by the raw ingredients, the oven temperature, or a scratch on the pan.

3. The "Good and Bad" Library (FAIR Data)

Most science books only show the "Five-Star Reviews" (successful experiments). LiST is different. It stores everything, including the "burnt toast" (failed experiments).

  • The Analogy: If you are trying to learn to bake, knowing that "adding too much flour at 300 degrees causes a fire" is just as valuable as knowing the perfect recipe. LiST saves these negative results so computers can learn from them.
  • The Result: This creates a massive library of "Findable, Accessible, Interoperable, and Reusable" (FAIR) data.

4. The "Digital Passport" (DOIs and Data Packages)

When a scientist publishes a paper, they usually just say, "We made a great sample." LiST gives that sample a Digital Passport (DOI).

  • The Analogy: Instead of just saying "I baked a cake," the scientist hands you a QR code. When you scan it, you see the exact recipe, the brand of flour used, the oven model, and a photo of the cake.
  • The Benefit: This makes science transparent. Anyone can verify the work, and if they want to try it themselves, they have the exact instructions.

5. The "Crystal Ball" (Machine Learning)

Because LiST has so much data (about 20,000 samples and thousands of failed attempts), it is perfect for teaching Artificial Intelligence (AI).

  • The Analogy: Imagine showing a robot chef 10,000 photos of cakes, including the ones that collapsed. The robot learns to predict exactly how to bake a perfect cake by spotting patterns humans might miss.
  • What the paper found: The researchers used LiST data to train AI to:
    • Look at a photo of a material and guess what temperature it was baked at.
    • Predict how much of a surface is covered by crystals just by looking at a picture.
    • Even guess what a material sounds like (spectroscopy) just by looking at its texture (microscopy).

Summary

The paper describes LiST not as a magic wand that invents new materials instantly, but as a powerful organizational tool. It turns the chaotic, messy process of experimental science into a clean, digital, and searchable history. By recording every step, saving every failure, and connecting the data to the final publication, LiST helps scientists move from "guessing and hoping" to "designing and knowing."

It is currently being used by the 2DCC and is expanding to help other research centers, acting as a bridge between the messy reality of the lab and the clean, organized world of data science.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →