SEISMO: Increasing Sample Efficiency in Molecular Optimization with a Trajectory-Aware LLM Agent

The paper introduces SEISMO, a trajectory-aware LLM agent that significantly improves sample efficiency in molecular optimization by performing strictly online updates conditioned on full optimization trajectories and explanatory feedback, achieving superior performance across 23 benchmark tasks with far fewer oracle calls than prior methods.

Original authors: Fabian P. Krüger, Andrea Hunklinger, Adrian Wolny, Tim J. Adler, Igor Tetko, Santiago David Villalba

Published 2026-02-19
📖 4 min read☕ Coffee break read

Original authors: Fabian P. Krüger, Andrea Hunklinger, Adrian Wolny, Tim J. Adler, Igor Tetko, Santiago David Villalba

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you are a master chef trying to invent the perfect new recipe for a dish that needs to be delicious, healthy, and cheap to make. However, there's a catch: tasting the dish is incredibly expensive and slow. You can only afford to taste it 50 times before your budget runs out.

Most computer programs trying to solve this problem act like a blindfolded chef throwing random ingredients into a pot, tasting the result, and hoping for the best. They might taste thousands of bad dishes before finding a good one. This is inefficient and wasteful.

This paper introduces SEISMO, a new "chef" powered by a Large Language Model (an AI that has read almost every cookbook and scientific paper ever written). SEISMO doesn't just guess; it learns from every single taste test and uses its vast knowledge of cooking chemistry to figure out exactly what to change next.

Here is how SEISMO works, broken down into simple concepts:

1. The "Blind" vs. The "Trajectory-Aware" Chef

  • Old Methods (The Blind Chef): Imagine a chef who tastes a dish, gets a score of "6/10," and then forgets the recipe. They try a completely new, random recipe next time. They don't remember why it was a 6 or what specific ingredient caused the problem. They are like a gambler rolling dice.
  • SEISMO (The Trajectory-Aware Chef): SEISMO is different. It keeps a detailed diary of every single attempt. When it tastes a dish and gets a "6," it doesn't just see the number. It reads the "taste notes" (explanations) that say, "Too salty, not enough spice, and the meat is too tough."
    • It uses this diary to say: "Okay, last time I added too much salt. This time, I'll reduce the salt and add more paprika, because I know from my training that paprika pairs well with this meat."
    • It connects the dots between the history of its attempts and the future of its guesses.

2. The "Oracle" (The Expensive Taste Test)

In the real world, testing a new drug molecule is like that expensive taste test. You have to run complex lab experiments or supercomputer simulations to see if it works. These are called Oracles.

  • Because these tests are so costly, scientists want to find the perfect molecule in as few tests as possible.
  • SEISMO is designed to be ultra-efficient. While other methods might need 1,000 taste tests to find a winner, SEISMO often finds a near-perfect recipe in just 50 tests.

3. The Secret Sauce: "Explanations"

The paper found that just giving the AI a score (like "6/10") isn't enough. The AI needs to know why.

  • Scenario A (No Explanation): The AI gets a score of "6." It guesses it needs to change the salt, but it might actually need to change the heat. It's guessing in the dark.
  • Scenario B (With Explanation): The AI gets a score of "6" and a note saying, "The salt is fine, but the heat was too low, making the meat tough."
    • SEISMO uses this note to make a precise correction.
    • The paper showed that when SEISMO gets these "explanations" (generated by AI tools that analyze why a molecule scored poorly), it becomes much faster at finding the solution. It's like having a sous-chef whispering, "Don't add more salt, turn up the heat!"

4. Why This Matters

Think of drug discovery as searching for a needle in a haystack the size of the Earth.

  • Traditional AI is like a robot that randomly grabs handfuls of hay, checks for a needle, and throws them away. It takes forever.
  • SEISMO is like a robot that has read every book on needles and hay. It knows that needles are usually metallic and sharp. It uses its "common sense" (pre-trained knowledge) to ignore the straw and focus on the metal. When it picks up a piece of hay that almost looks like a needle, it gets a note saying, "It's metal, but it's bent." It then straightens it out immediately.

The Bottom Line

SEISMO is a smart, memory-keeping AI agent that optimizes molecules by talking to itself about its past failures and successes. By combining its vast library of chemical knowledge with detailed feedback from expensive tests, it finds better drugs much faster than previous methods.

In short: It turns the process of drug discovery from a game of "guess and check" into a strategic conversation, saving time, money, and resources in the race to cure diseases.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →