CA-HFP: Curvature-Aware Heterogeneous Federated Pruning with Model Reconstruction

CA-HFP is a novel federated learning framework that enables personalized, curvature-aware structured pruning on heterogeneous edge devices with model reconstruction, offering a theoretical convergence guarantee and demonstrating superior efficiency and accuracy compared to existing baselines across diverse datasets and architectures.

Gang Hu, Yinglei Teng, Pengfei Wu, Shijun Ma

Published 2026-03-16
📖 4 min read☕ Coffee break read

Imagine a massive group project where hundreds of students (devices) are trying to write a single, perfect encyclopedia together. However, there's a catch: they can't share their private notes (data) with each other, and they all have very different resources. Some students have supercomputers and fast internet; others have old calculators and dial-up connections. Some are experts in history, while others only know about cooking.

This is the world of Federated Learning. The goal is to build one great encyclopedia without anyone ever seeing everyone else's private notes.

The problem? If everyone tries to send their whole encyclopedia draft back and forth, the internet gets clogged, and the students with old calculators crash. If they just send small, random chunks, the final book ends up messy and full of contradictions.

Enter CA-HFP (Curvature-Aware Heterogeneous Federated Pruning). Think of it as a smart, adaptive project manager that solves these problems using three clever tricks.

1. The "Customized Backpack" (Personalized Pruning)

In a normal project, everyone is asked to carry the same heavy backpack full of every single fact. But in CA-HFP, the manager looks at each student's backpack capacity.

  • Student A (a powerful phone) gets a backpack with 80% of the facts.
  • Student B (a tiny sensor) gets a backpack with only 40% of the facts.

But here's the genius part: How do they decide which facts to keep?
Old methods just said, "Keep the biggest facts" or "Keep the facts you used most recently." CA-HFP uses a "Curvature-Aware" compass. Imagine the facts are on a hilly landscape.

  • Some facts are on a flat plain (not very important).
  • Some are on a steep cliff (very important, changing them ruins the whole map).
  • CA-HFP calculates the "steepness" (curvature) of the hill for every fact. It tells the students: "Keep the facts that are on the steep cliffs because if you drop those, the whole map collapses. You can safely throw away the ones on the flat plains."

This ensures that even though students carry different amounts of information, they are all carrying the most critical pieces of the puzzle.

2. The "Magic Translator" (Model Reconstruction)

Now, imagine the students finish their work and send their backpacks back to the manager to combine them.

  • Student A sent a backpack with 80% of the facts.
  • Student B sent a backpack with 40% of the facts.

If the manager tries to mix them directly, it's like trying to glue together two puzzle pieces that are different shapes. It won't fit! This is the "structural mismatch" problem.

CA-HFP introduces a Magic Translator (the Reconstruction step). Before mixing the backpacks, the manager takes Student B's sparse backpack and "fills in the blanks" using the current master copy of the encyclopedia.

  • If Student B didn't send a fact about "Ancient Rome," the manager looks at the master copy, sees what the current best guess for "Ancient Rome" is, and temporarily fills that spot in Student B's backpack.
  • Now, every backpack looks like it has the same shape and size, even though the content inside is still unique to that student.

This allows the manager to mix them all together perfectly without the pieces clashing.

3. The "Fairness Guarantee" (Convergence)

The paper also proves mathematically that this system won't go crazy. Because the students have different data (some know cooking, some know history) and different backpack sizes, the final encyclopedia could become biased or unstable.

CA-HFP calculates a "safety margin." It knows exactly how much the "steepness" of the hills (curvature) and the "missing facts" (pruning) will shake the table. By adjusting how much each student works and how they mix their notes, it guarantees that the group will eventually agree on a stable, high-quality encyclopedia, even if the students are very different.

The Result?

In the real world, this means:

  • Faster Internet: Students send much less data because they only send their "backpacks" (sparse models), not the whole encyclopedia.
  • Less Battery Drain: Students with weak phones don't have to do heavy lifting; they only process the specific facts they are allowed to keep.
  • Smarter Results: Despite the differences, the final model is just as accurate (or better) than if everyone had sent everything.

In short: CA-HFP is like a smart team leader who gives everyone a custom-sized backpack, tells them exactly which items are too heavy to drop, and uses a magic trick to make sure everyone's different backpacks can be mixed together perfectly to build one amazing result.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →