MyeGPT: an AI agent for Multiple Myeloma

Original authors: Chang, J. G., Gout, A. M., Rodiger, J., Chung, T.-H., Mulligan, G., Chng, W. J.

Published 2026-05-23

📖 5 min read🧠 Deep dive

Original authors: Chang, J. G., Gout, A. M., Rodiger, J., Chung, T.-H., Mulligan, G., Chng, W. J.

Original paper licensed under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). ⚕️ This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you have a massive, incredibly detailed library of medical records for a specific type of blood cancer called Multiple Myeloma. This library, known as the CoMMpass study, contains the "life stories" of 1,143 patients. It includes their symptoms, treatment history, genetic makeup, and how long they lived. However, there's a catch: this library is written in a secret code (complex computer programming languages) that only a few specialized librarians (bioinformaticians) can read. For a regular doctor or researcher, trying to find a specific answer—like "Do patients with a certain gene mutation live longer?"—is like trying to find a needle in a haystack while wearing blindfolded gloves.

Enter MyeGPT.

What is MyeGPT?

Think of MyeGPT as a super-smart, bilingual librarian who has memorized the entire CoMMpass library. You don't need to know the secret code to talk to it. You can simply ask it in plain English, just like you would ask a human: "Show me the survival rates for patients who relapsed after their first treatment," or "Compare the overall health of patients with high versus normal levels of a specific protein."

MyeGPT doesn't just guess the answer. It acts like a detective that:

Understands your question.
Digs into the database to find the exact pages of data needed.
Does the math to calculate the answer.
Draws a picture (a chart or graph) to show you the results.

How Does It Work?

The researchers built this "librarian" using a type of Artificial Intelligence called Agentic AI. Unlike a standard chatbot that just chats, an "agent" can actually do things.

The Brain (LLM): This is the part that understands your language and plans the steps to solve the problem.
The Tools: MyeGPT has a set of digital tools. It has a "search tool" to find the right definitions in the library's manual, a "database tool" to run complex queries, and a "drawing tool" to create graphs.
The Memory (Knowledge Base): Before it starts, the team fed MyeGPT a 52-page instruction manual that explains exactly what every column in the database means. This ensures it doesn't get confused about what "treatment response" or "copy number" actually refers to.

Did It Work? (The Test Drive)

The researchers didn't just build it; they put it through a rigorous driving test to see if it was safe and accurate.

The Trivia Test: They asked MyeGPT 20 tricky questions that required complex math and data digging. They compared its answers against a "gold standard" created by human experts.
- The Result: The best version of MyeGPT got a score of 79.4 out of 100. It was good enough to be useful, though not perfect. Interestingly, a smaller, cheaper AI model performed almost as well as the giant, expensive ones, making it a cost-effective choice.
The "Replay" Test: They asked MyeGPT to recreate famous studies that had already been published by humans.
- The Result: MyeGPT successfully reproduced the findings of two major studies. It calculated survival rates and gene expression patterns that matched the original human researchers' work almost perfectly.
The "Human vs. Machine" Test: They asked both MyeGPT and human experts to classify patients into risk groups (High Risk vs. Standard Risk).
- The Result: The agreement between the AI and the humans was incredibly high (96.5%). This means the AI "thinks" very similarly to a human expert when looking at this data.

What Can It Do Right Now?

The paper shows MyeGPT handling three types of tasks:

Simple: "How many patients took Drug X?" (One quick look-up).
Medium: "Do patients with Gene Y respond better to Drug Z?" (Comparing two different sets of data).
Complex: "Scan the entire genetic code of all 1,143 patients to find any gene that might be linked to treatment failure." (A massive, genome-wide search).

Important Limitations (The Fine Print)

The authors are very clear about what MyeGPT is not:

It is not a doctor. You cannot ask it, "Should I give this specific patient this specific drug?" The paper explicitly states that MyeGPT is for research and hypothesis generation only. Because AI can make mistakes and the data comes from an observational study (not a controlled clinical trial), its advice should never be used to make real-life medical decisions for patients.
It needs a research setting. It is designed to help scientists explore data on their laptops or even smartphones, but it is not a clinical tool for hospitals yet.

The Bottom Line

MyeGPT is a proof-of-concept tool that bridges the gap between complex, massive medical datasets and the researchers who want to use them. It turns a library that requires a PhD in computer science to navigate into a conversational chat interface. While it's not ready to replace doctors, it acts as a powerful "co-pilot" for scientists, helping them ask questions and find answers in the vast ocean of Multiple Myeloma data much faster than before.