A Comparison of Massively Parallel Performance Portable Particle-in-Cell schemes for electrostatic kinetic plasma simulations

This paper evaluates the performance and portability of various Poisson solvers, including FFT, PCG, FEM, and the novel Particle-in-Fourier (PIF) schemes, within the IPPL library for electrostatic PIC simulations on diverse GPU architectures, finding that while FFT is fastest, the PIF scheme offers excellent scalability as a high-fidelity alternative.

Original authors: Sonali Mayani, Paul Fischill, Sriramkrishnan Muralikrishnan, Andreas Adelmann

Published 2026-05-08
📖 5 min read🧠 Deep dive

Original authors: Sonali Mayani, Paul Fischill, Sriramkrishnan Muralikrishnan, Andreas Adelmann

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are trying to simulate a massive crowd of people (particles) moving through a room, where everyone is pushing and pulling on each other based on their position. This is essentially what scientists do when they simulate plasma (a super-hot, electrically charged gas) to understand how it behaves.

This paper is a "race report" comparing different ways to calculate the forces between these particles to see which method is the fastest and most reliable on the world's most powerful supercomputers.

Here is the breakdown of the race using simple analogies:

The Setting: The "Particle-in-Cell" Loop

Think of the simulation as a game played in rounds. In every round, the computer does four things:

  1. Scatter: It takes the positions of the particles and paints their "charge" onto a grid (like a chessboard).
  2. Solve: It calculates the electric field (the push/pull force) everywhere on that grid based on the painted charges. This is the main event of the race.
  3. Gather: It reads the force from the grid and tells each particle how to move.
  4. Push: The particles move to their new spots.

The authors tested four different "Solvers" (methods to calculate step 2) to see which one wins.

The Four Racers

1. The FFT Solver (The Fast Sprinter)

  • How it works: This method uses a mathematical trick called the "Fast Fourier Transform." Imagine trying to solve a puzzle by instantly seeing the whole picture in a mirror rather than looking at one piece at a time. It's incredibly fast.
  • The Catch: It only works if the room has "periodic" boundaries. Think of it like a video game world where if you walk off the right edge, you instantly appear on the left. It can't handle walls or open doors.
  • The Result: It was the absolute fastest in terms of raw time. However, on one specific supercomputer (Alps), it stumbled because the "particle movement" part of the loop got stuck, slowing down the whole race.

2. The PCG Solver (The Reliable Workhorse)

  • How it works: This method breaks the grid down into tiny squares and solves the math step-by-step, like a detective checking every clue one by one. It uses a "Preconditioned Conjugate Gradient" approach.
  • The Catch: It is much slower than the FFT (about 10 times slower in raw time), but it is very flexible. It can handle walls (Dirichlet) or open spaces (Neumann), not just the "wrap-around" video game world.
  • The Result: It scales well (gets faster as you add more computers), but it takes longer to finish the job.

3. The FEM Solver (The High-Accuracy Architect)

  • How it works: This is the "Finite Element Method." Instead of a rigid grid, it treats the space like a flexible mesh that can bend and fit complex shapes. It's like using a tailor-made suit instead of a boxy, off-the-rack shirt.
  • The Catch: Like the PCG, it is slower than the FFT. It also struggles a bit with communication between computers because it has to constantly check the edges of its flexible mesh.
  • The Result: It's great if you need high precision or complex shapes, but it's not the speed champion.

4. The PIF Solver (The New Contender)

  • How it works: This is the "Particle-in-Fourier" scheme. Instead of painting particles onto a grid first, it projects them directly into "frequency space" (a mathematical representation of waves). It's like skipping the map entirely and navigating by the rhythm of the waves.
  • The Catch: It requires special math (Non-Uniform FFTs) to handle particles that aren't perfectly aligned.
  • The Result: It is more expensive (slower) than the FFT, but it is incredibly stable and accurate. It doesn't suffer from the "ghosting" or "aliasing" errors that happen when you try to fit a round particle onto a square grid. It scales beautifully on all machines, meaning it gets faster very efficiently as you add more power.

The Race Track (The Supercomputers)

The authors ran these tests on three different "tracks" (supercomputers) with different engines:

  • Alps (Switzerland): Uses Nvidia's newest chips.
  • LUMI (Finland): Uses AMD chips.
  • JUWELS Booster (Germany): Uses older Nvidia chips.

The Winner's Podium

  • Raw Speed: The FFT Solver wins hands down, but only if your problem fits its strict rules (periodic boundaries) and you aren't using the specific Alps machine where a technical glitch slowed it down.
  • Flexibility: The PCG and FEM solvers are the best choice if your simulation has walls or complex shapes. They are slower but get the job done where the FFT can't go.
  • High Fidelity: The PIF solver is the new star. While it takes a bit longer than the FFT, it offers the best balance of speed, stability, and accuracy. It's like a sports car that is slightly slower than a Formula 1 car but handles the turns much better and is safer to drive.

The Bottom Line

The paper concludes that there is no single "best" solver.

  • If you need speed and have simple boundaries, use FFT.
  • If you need flexibility (walls, complex shapes), use PCG or FEM.
  • If you need high accuracy and stability without the errors of standard methods, PIF is an excellent, scalable alternative.

The authors also noted that they are currently working on fixing the "particle update" glitch on the Alps supercomputer and improving the "preconditioning" (a way to speed up the math) for the FEM solver to make them even faster in the future.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →