Systematic Fine-Tuning of MACE Interatomic Potentials for Catalysis

This paper systematically evaluates nine MACE-based machine-learned interatomic potentials, demonstrating that while from-scratch models require specific high-energy training configurations to reduce errors, fine-tuning large foundation models offers superior transferability and accuracy across diverse catalytic reactions and out-of-distribution scenarios.

Original authors: Nima Karimitari, Jacob Clary, Derek Vigil-Fowler, Ravishankar Sundararaman, Gábor Csányi, Christopher Sutton

Published 2026-05-12
📖 5 min read🧠 Deep dive

Original authors: Nima Karimitari, Jacob Clary, Derek Vigil-Fowler, Ravishankar Sundararaman, Gábor Csányi, Christopher Sutton

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are trying to teach a computer to predict how chemicals react on a catalyst (a material that speeds up reactions, like a spark plug for a car engine). To do this, the computer needs a "map" of the energy landscape, showing where the hills (barriers to reaction) and valleys (stable states) are.

Traditionally, drawing this map requires incredibly slow and expensive supercomputer calculations (called DFT). Machine Learning Interatomic Potentials (MLIPs) are like a shortcut: they are smart AI models that learn to draw this map almost instantly, with near-perfect accuracy.

This paper is a guide on how to train these AI models most effectively. The authors tested two main ways to teach the AI: starting from zero ("From-Scratch") and giving it a head start with a pre-trained "foundation" model ("Fine-Tuning").

Here is the breakdown of their findings using simple analogies:

1. The Two Training Strategies

Strategy A: From-Scratch (FS) – "The Blank Slate"
Imagine trying to teach a student to navigate a city by only showing them the main streets where people live (relaxed, stable structures).

  • The Problem: If you only show them the main streets, they get lost when they encounter a construction zone or a detour (high-energy, unstable states where bonds are breaking).
  • The Fix: The authors found that for these "blank slate" models, you must show them the "construction zones." By adding data from simulations that shake the atoms around (Molecular Dynamics) or force them to stretch until they almost break (Contour Exploration), the model learns the tricky parts of the map.
  • Result: Without these "chaos" examples, the model makes big mistakes. With them, the error drops by more than half.

Strategy B: Fine-Tuning (FT) – "The Expert Intern"
Imagine hiring a student who has already graduated from a top university with a degree in chemistry (a pre-trained "Foundation Model" called MACE-MH-1). They already know the general layout of the world.

  • The Advantage: You don't need to show them every single street or every construction zone. You just need to give them a specific handbook for the neighborhood you care about (e.g., metal catalysts or metal oxides).
  • The Result: These "interns" are much more robust. Even if you only give them a small amount of specific data, they perform better than the "blank slate" students, even on reactions they haven't seen before (out-of-distribution). They are less sensitive to how you collect the data.

2. The "Cross-Training" Surprise

One of the coolest findings is that these models can learn from one type of material and apply it to another.

  • The Analogy: It's like teaching a chef how to cook steak (metal catalysts) and then asking them to cook a vegetable dish (metal-oxide catalysts).
  • The Finding: The authors found that if you fine-tune the model on metal data, it surprisingly gets really good at predicting reactions on metal oxides, even if it never saw metal oxides in its specific training set. Conversely, training on metal oxides helped it predict reactions on metals.
  • Why? The model learned the fundamental "physics" of how atoms bond and break, which applies to both materials.

3. The "Super-Model" and the Big Screen

The authors combined all their best data to create one "Super-Model" (FT-All).

  • The Test: They used this model to screen a massive library of 90,781 different chemical combinations (binary alloys) to see which ones might be good catalysts.
  • The Outcome: The model was incredibly accurate, with an error rate of just 0.15 eV (a very small margin of error in this field). It successfully predicted how chemicals would stick to surfaces it had never seen before, including complex, jagged surfaces (high Miller index surfaces) that are hard to model.

4. Why This Matters for Real Reactions

The paper tested these models on real-world, difficult chemical reactions:

  • CO2 Reduction: Turning carbon dioxide into useful fuels (like ethylene or ethanol).
  • Oxygen Evolution: A key step in making clean energy (splitting water).
  • Propane Dehydrogenation: Making plastics.

In almost every case, the Fine-Tuned models outperformed the From-Scratch models. They were faster to train, required less data, and were more accurate at predicting the "energy barriers" (the height of the hill the reaction has to climb).

The Bottom Line

If you want to build a machine learning model for catalysis:

  1. Don't start from scratch unless you have a massive, chaotic dataset full of broken bonds and high-energy states.
  2. Start with a pre-trained foundation model and "fine-tune" it with a smaller, targeted dataset. It's like giving a smart intern a specific project rather than hiring a novice and teaching them everything from day one.
  3. Don't worry too much about perfect data diversity if you are fine-tuning; the foundation model has already learned the general rules of the universe.

This work provides a roadmap for scientists to build better, faster, and more accurate AI tools to discover new catalysts for clean energy and sustainable chemistry.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →