HiLiftAeroML: High-Fidelity Computational Fluid Dynamics Dataset for High-Lift Aircraft Aerodynamics

This paper introduces HiLiftAeroML, the first open-source high-fidelity CFD dataset featuring 1,800 GPU-accelerated LES simulations of NASA's CRM high-lift geometry, designed to accelerate the development of AI surrogate models for aerospace applications.

Original authors: Neil Ashton, Adam Clark, Liam Heidt, Christopher Ivey, Sanjeeb Bose, Rahul Agrawal, Konrad Goc, Rishi Ranade, Corey Adams, Peter Sharpe, Sheel Nidhan, Semit Akkurt, Daniel Leibovici, Jean Kossaifi

Published 2026-05-20
📖 5 min read🧠 Deep dive

Original authors: Neil Ashton, Adam Clark, Liam Heidt, Christopher Ivey, Sanjeeb Bose, Rahul Agrawal, Konrad Goc, Rishi Ranade, Corey Adams, Peter Sharpe, Sheel Nidhan, Semit Akkurt, Daniel Leibovici, Jean Kossaifi

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are trying to teach a robot how to fly a complex airplane. To do this, you need to show it thousands of examples of how the air moves around the wings, especially when the plane is taking off or landing. These moments are tricky because the air gets messy, swirling, and separating from the wings in chaotic ways.

For a long time, scientists have used two main ways to study this:

  1. Wind Tunnels: Building physical models and blowing real air at them. This is accurate but incredibly expensive and slow.
  2. Computer Simulations (CFD): Using math to predict the air. The standard method is fast but often gets the messy parts wrong, like a blurry photo. A better method exists that takes high-definition photos of the air, but it usually takes supercomputers weeks to generate just one picture.

The Problem: To train a smart AI (a "surrogate model") to predict these messy airflows instantly, you need a massive library of these high-definition pictures. But until now, that library didn't exist for complex airplanes.

The Solution: HiLiftAeroML
This paper introduces HiLiftAeroML, a massive, free, open-source library of 1,800 high-definition "snapshots" of air flowing around a specific type of airplane (the NASA Common Research Model).

Here is how they built it, using some simple analogies:

1. The Airplane: A Shape-Shifting Lego Set

The researchers didn't just use one airplane. They used a digital version of the NASA "Common Research Model" (CRM), which is like a standard Lego airplane used by scientists worldwide.

  • The Twist: They made the Lego pieces move. They created 180 different versions of this airplane by changing the angles of the flaps and slats (the little wings on the front and back that pop out during takeoff and landing).
  • The Weather: For each of those 180 shapes, they simulated the air hitting the plane at 10 different angles (from a gentle approach to a steep climb).
  • The Result: 1,800 unique scenarios (180 shapes × 10 angles).

2. The Camera: A Super-Sharp Lens

Most computer simulations use a "blurry" lens (called RANS) that averages out the chaos. It's like watching a sports game through a foggy window; you see the players moving, but you miss the individual spins and collisions.

For this dataset, the authors used a Wall-Modeled Large Eddy Simulation (WMLES).

  • The Analogy: Think of this as a 4K, slow-motion camera that captures every single swirl and eddy of the air.
  • The Cost: This "camera" is so powerful that it requires a grid of 300 to 500 million tiny cells (pixels) just to cover the airplane. To put that in perspective, a standard simulation might use 10 million cells. This is like upgrading from a standard-definition TV to a massive, ultra-high-definition screen.
  • The Hardware: They ran these simulations on NVIDIA GPUs (the same powerful chips used for gaming and AI), which acted like a fleet of super-fast cameras snapping these pictures.

3. The Library: Free for Everyone

The authors didn't keep these 1,800 high-definition snapshots to themselves. They put the entire library on the internet (HuggingFace) for anyone to download for free.

  • What's inside: You get the 3D shape of the airplane, the "blurry" average forces (lift and drag), and the detailed "high-def" data of the air pressure and speed inside and around the plane.
  • The Goal: They want AI researchers to use this library to train their own "flight robots." Once an AI learns from these 1,800 perfect examples, it should be able to predict how the air behaves on new airplane designs in a split second, without needing to run the expensive, slow simulation again.

4. Did it Work? (The Quality Check)

Before releasing the library, the authors checked their work against real-world wind tunnel experiments.

  • The Test: They compared their computer "photos" of a specific landing configuration against actual photos taken in a wind tunnel.
  • The Result: Their high-definition simulation matched the real-world data very well, especially for the tricky parts like "drag" (air resistance) and "pitching moment" (how the nose wants to tilt). This proves their "camera" was sharp enough to capture the real physics.

Summary

In short, the authors built the first-ever "high-definition" library of airplane aerodynamics for takeoff and landing scenarios. They used the most advanced, expensive, and accurate computer methods available to generate 1,800 examples. By making this data free, they hope to help engineers and AI developers build smarter, faster tools to design safer and more efficient airplanes in the future.

What the paper does NOT claim:

  • It does not claim that AI has already replaced wind tunnels (it's a tool to help, not a replacement yet).
  • It does not claim to have solved the physics of every possible airplane (it focuses on this specific NASA model).
  • It does not claim to have simulated full-scale flight conditions (the data is based on wind-tunnel scale conditions).

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →