Siamese Foundation Models for Crystal Structure Prediction

The paper introduces Diffusion-based Crystal Omni (DAO), a pretrain-finetune framework utilizing Siamese foundation models that significantly outperforms conventional methods in predicting crystal structures, achieving high accuracy on real-world superconductors while operating over 2,000 times faster than DFT-based approaches.

Original authors: Liming Wu, Wenbing Huang, Rui Jiao, Jianxing Huang, Liwei Liu, Yipeng Zhou, Hao Sun, Yang Liu, Fuchun Sun, Yuxiang Ren, Jirong Wen

Published 2026-04-15
📖 5 min read🧠 Deep dive

This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are a master architect trying to design a new, super-strong building. You have a list of materials (like steel, glass, and concrete), but you don't know how to arrange them to make the building stand up without collapsing. In the world of science, this is called Crystal Structure Prediction (CSP). Scientists want to know: "If I mix these specific atoms together, what 3D shape will they naturally form to be the most stable?"

For decades, solving this puzzle has been like trying to find a needle in a haystack while blindfolded. Traditional methods are slow, expensive, and often get stuck in dead ends.

This paper introduces a new AI system called DAO (Diffusion-based Crystal Omni) that acts like a "super-architect" team. Here is how it works, using simple analogies:

1. The Two-Person Dream Team (Siamese Models)

Instead of one AI trying to do everything, the authors created two AI models that work together like a Builder and a Safety Inspector.

  • The Builder (DAO-G): This AI is the creative one. Its job is to imagine and draw thousands of different 3D structures based on a list of ingredients (chemical composition). It uses a technique called "diffusion," which is like starting with a cloud of static noise and slowly clearing it away until a clear crystal shape emerges.
  • The Safety Inspector (DAO-P): This AI is the expert on physics. It doesn't draw buildings; it checks them. It looks at a structure and says, "That looks wobbly; it will collapse," or "That looks solid and stable." It predicts the energy of the structure. In physics, lower energy means a more stable, happy crystal.

The Magic Trick: They are "Siamese" twins, meaning they share the same brain architecture. They talk to each other constantly. While the Builder is drawing, the Inspector whispers, "Hey, that corner looks unstable, try moving that atom over there." This feedback loop helps the Builder create better designs much faster.

2. Training on "Mistakes" (The Two-Stage Pretraining)

Usually, AI models are only trained on perfect examples (stable crystals). But this paper argues that's like teaching a pilot only on smooth flights; they won't know how to handle turbulence.

  • Stage 1: The Builder is trained on a massive library of 940,000 crystal structures. Crucially, this library includes both perfect crystals and broken, unstable ones. The AI learns what not to build.
  • Stage 2 (The Fix-It Phase): The Safety Inspector steps in. It takes the "broken" crystals from the library and uses its physics knowledge to "relax" them—essentially fixing the wobbly parts to make them stable. The Builder then retrains on these "fixed" versions.

Analogy: Imagine learning to cook. First, you taste a million dishes, including burnt ones and raw ones (Stage 1). Then, a master chef takes the burnt ones, fixes them, and shows you the corrected version (Stage 2). Now, you know exactly how to avoid the mistakes and how to fix them if they happen.

3. The "Energy Compass" (Energy-Guided Sampling)

When the Builder is generating a new crystal, it doesn't just guess randomly. It uses the Safety Inspector as a compass.

In the real world, atoms naturally want to settle into the lowest energy state (like a ball rolling to the bottom of a hill). The AI uses the Inspector to constantly nudge the ball down the hill. If the Builder tries to create a structure that is too "high energy" (unstable), the Inspector pushes it back toward a stable shape. This ensures the final result is not just a random shape, but a physically possible, stable crystal.

4. The Real-World Test: Superconductors

To prove this isn't just a video game, the team tested DAO on superconductors—materials that conduct electricity with zero resistance, which are incredibly hard to design.

  • The Challenge: They picked three real-world superconductors that the AI had never seen before.
  • The Result:
    • Accuracy: For one of them (Cr6Os2), the AI got a 100% match with the real experimental structure. It was almost perfect.
    • Speed: This is the biggest win. Traditional methods (using supercomputers to simulate physics) take 2,000 times longer to find the structure than this AI. The AI did in 1.5 minutes what used to take hours or days.
    • Prediction: The Safety Inspector also predicted the "critical temperature" (how cold it needs to be to work) with incredible accuracy, almost matching real-world measurements.

Why This Matters

Think of materials science as trying to discover new medicines or better batteries. Currently, scientists are like people searching for a needle in a haystack by hand.

DAO is like a metal detector.
It doesn't just find the needle; it tells you exactly where it is, what it looks like, and how strong it is, all in a fraction of the time. By combining a creative generator with a physics-aware inspector, and training them on a massive, diverse dataset, this system opens the door to discovering new materials for clean energy, quantum computing, and advanced electronics at a speed we've never seen before.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →