CPLfold: Chimeric and Pseudoknot-capable almost Linear-time RNA Secondary Structure Prediction

CPLfold is a fast and flexible RNA secondary structure prediction method that integrates thermodynamic modeling with chimeric evidence from cross-linking experiments to accurately predict pseudoknots and long-range interactions in long sequences.

Original authors: Wang, K., Kudla, G., Cohen, S. B.

Published 2026-02-14
📖 3 min read☕ Coffee break read
⚕️

This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer

Imagine you have a long, tangled string of yarn. This yarn represents a strand of RNA, a molecule that acts as a messenger and worker inside our cells. To do its job, this string doesn't just hang straight; it folds up into a specific 3D shape, like a complex origami crane. If the shape is wrong, the cell's machinery breaks down.

The Problem: The Tangled Knots
Scientists have been trying to predict how this string folds for years. They use a "rulebook" based on physics (thermodynamics) to guess where the string will loop back and stick to itself. However, there are two big headaches:

  1. The Knots (Pseudoknots): Sometimes the string ties itself into complex knots that cross over themselves. Standard rulebooks get confused by these knots and often give up or make mistakes.
  2. The Long Distance: For very long strings, the computer takes forever to calculate the folds, making it impractical for real-world use.

Scientists have tried to help by using "proximity ligation" experiments. Think of this as taking a photo of the string while it's folding. The photo shows two parts of the string that are touching, but it doesn't tell you exactly how the rest of the string is arranged. It's like seeing two people holding hands in a crowd but not knowing the layout of the whole party.

The Solution: CPLfold
Enter CPLfold, a new computer program designed to solve this puzzle. Here is how it works, using a few simple analogies:

  • The Detective with a Clue: Imagine a detective trying to solve a crime scene (the RNA structure). The detective has the standard rulebook (physics), but they also have a new, powerful clue: a photo showing two specific suspects holding hands (the chimeric evidence). CPLfold is the detective that knows how to use that photo to solve the case much faster and more accurately than before.
  • The Flexible Map: The program is special because it can handle those tricky "knots" (pseudoknots) that other programs can't. It's like a GPS that can navigate a city with overpasses and underpasses, whereas older GPS models get stuck in traffic circles.
  • The Volume Knob: The researchers added two simple "knobs" (parameters) to the software.
    • One knob controls how much weight to give the "photo clue" versus the "rulebook."
    • The other controls how aggressively the program should try to tie those complex knots.
    • This lets scientists tune the program like a radio, finding the perfect balance between trusting the new data and sticking to the known rules.

The Result
CPLfold is fast enough to handle very long strings of RNA without crashing the computer. When tested against other methods, it built more accurate models of the RNA's shape, especially for the long-range connections that are hard to see.

In a Nutshell
CPLfold is a new, super-fast tool that helps scientists figure out the 3D shape of RNA molecules. It does this by combining the laws of physics with direct experimental clues, all while untangling the complex knots that have stumped computers for years. It's like giving a 3D printer a better blueprint so it can build the perfect molecular machine every time.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →