Diversity-Aware Adaptive Collocation for Physics-Informed Neural Networks via Sparse QUBO Optimization and Hybrid Coresets

This paper proposes a diversity-aware adaptive collocation method for Physics-Informed Neural Networks that formulates point selection as a sparse QUBO optimization problem on a kNN graph to efficiently construct hybrid coreset subsets, thereby reducing training redundancy and overhead while improving accuracy on PDEs with shock formation.

Hadi Salloum, Maximilian Mifsud Bonici, Sinan Ibrahim, Pavel Osinenko, Alexei Kornaev

Published 2026-03-10
📖 5 min read🧠 Deep dive

The Big Picture: Teaching a Robot to Predict the Weather

Imagine you are trying to teach a robot (a Physics-Informed Neural Network, or PINN) how to predict the weather. The robot knows the laws of physics (like how wind blows or rain falls), but it needs to "practice" on specific spots on a map to learn how to apply those laws correctly.

These practice spots are called collocation points.

The Problem:
If you just pick random spots to practice on (like throwing darts blindfolded), the robot wastes time practicing on calm, boring days when it already knows the answer. It misses the exciting, dangerous parts like hurricanes or tornadoes (which in math are called shocks or steep gradients).

If you only pick the spots where the robot is currently failing the hardest, you end up with a problem: the robot practices only on the tornadoes and ignores the rest of the country. It becomes an expert at tornadoes but forgets how to predict a gentle breeze. It gets "stuck" in one area and misses the big picture.

The Solution:
This paper proposes a smarter way to pick practice spots. It's like hiring a Talent Scout who doesn't just look for the "best" players, but builds a balanced team.


The Three Key Ideas

1. The "Talent Scout" vs. The "Hype Machine"

  • Old Way (Residual-Based): Imagine a coach who only picks players who are currently making the most mistakes. They keep picking the same player over and over because they keep messing up. The team becomes unbalanced.
  • New Way (Diversity-Aware Coresets): The new method acts like a smart Talent Scout. They want players who are good at fixing mistakes (Informative), but they also want players who are different from each other (Diverse). They don't want ten players who all play the exact same position in the exact same spot. They want a mix that covers the whole field.

2. The "Sparse Graph" (Avoiding the Traffic Jam)

To find this perfect mix, the authors use a mathematical tool called QUBO (which sounds like a robot code, but think of it as a complex puzzle).

  • The Old Puzzle: Imagine trying to solve a puzzle where every single piece is connected to every other piece. If you have 1,000 pieces, that's a million connections. It takes forever to solve and crashes your computer.
  • The New Puzzle (Sparse): The authors realized you don't need to check every connection. You only need to check the pieces that are close to each other (like neighbors in a neighborhood). By only looking at the "neighbors" (using a kNN graph), the puzzle becomes much smaller and faster to solve. It's like asking your immediate neighbors for advice instead of calling the whole city.

3. The "Hybrid Anchors" (The Safety Net)

Even with a smart Talent Scout, there's a risk they might get too obsessed with the "tornado" areas and forget the "breeze" areas entirely.

  • The Fix: The authors introduce Hybrid Anchors. Think of these as mandatory safety posts.
    • They reserve 20% of the practice spots to be spread out evenly across the whole map (like lighthouses on a coast).
    • They use the smart Talent Scout (the QUBO solver) to pick the remaining 80% to focus on the tricky, high-error areas.
    • Result: The robot learns the hard stuff and remembers the basics. It doesn't get lost.

How It Works in Real Life (The Analogy)

Imagine you are painting a giant mural of a stormy sea.

  1. Uniform Sampling: You spray paint dots randomly everywhere. You waste a lot of paint on the calm blue sky and miss the crashing waves.
  2. Residual-Only: You only spray paint where the waves are crashing hardest. You end up with a beautiful, detailed wave, but the rest of the canvas is blank. The painting looks weird and incomplete.
  3. This Paper's Method:
    • Step 1: You put down a few "Anchor" dots evenly across the whole canvas so you have a frame of reference.
    • Step 2: You use a smart algorithm to find the specific spots where the waves are crashing and where the paint is currently messy.
    • Step 3: You make sure you don't put all your new dots right next to each other (that would be redundant). You spread them out to cover different parts of the wave.
    • Step 4: You do this quickly using a "neighbor-only" check instead of checking the whole canvas.

The Results: Faster and Better

The authors tested this on a famous math problem involving fluid dynamics (the Burgers' Equation, which is like a simplified model of traffic jams or shockwaves).

  • Accuracy: Their method made fewer mistakes than the old ways.
  • Speed: Because they used the "Sparse" method (checking neighbors instead of everyone), they solved the puzzle 3 times faster than the heavy, dense method.
  • Efficiency: By using the "Hybrid Anchors," they reached the same level of accuracy in 38% less time than just using random sampling.

The Takeaway

This paper teaches us that when training AI to solve physics problems, quality and variety matter more than just quantity.

Instead of blindly throwing more data at the problem or obsessing only on the hardest parts, we should use a smart, balanced approach that:

  1. Focuses on the hard parts.
  2. Ensures we don't forget the easy parts.
  3. Does it all quickly by ignoring unnecessary connections.

It's the difference between a chaotic mob of people trying to fix a leak and a well-organized construction crew with a plan, the right tools, and a safety net.