Proxics: an efficient programming model for far memory accelerators

This paper proposes "Proxics," an efficient programming model for Near-Data Processing accelerators that adapts familiar OS abstractions like virtual processors and IPC channels into lightweight mechanisms via compilation and interconnect protocols, demonstrating significant performance gains and highlighting the critical need for low-latency CPU-to-accelerator communication.

Original authors: Zikai Liu, Niels Pressel, Jasmin Schult, Roman Meier, Pengcheng Xu, Timothy Roscoe

Published 2026-04-21
📖 5 min read🧠 Deep dive

Original authors: Zikai Liu, Niels Pressel, Jasmin Schult, Roman Meier, Pengcheng Xu, Timothy Roscoe

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are running a massive library (a data center) where the books (data) are stored in a giant, remote warehouse (Far Memory) far away from the main reading room (the CPU).

The Problem:
Usually, when a librarian (the CPU) needs a book, they have to walk all the way to the warehouse, grab it, and bring it back. If the warehouse is huge and the librarian is busy, this walk takes a long time, and the librarian spends more time walking than reading.

To fix this, engineers built Near-Data Processing (NDP) accelerators. Think of these as small, temporary reading desks set up right inside the warehouse. Instead of the main librarian walking back and forth, a small assistant sits at the desk, grabs the books, does the work (like summarizing or sorting), and only sends the final result back to the main room.

The Challenge:
The problem is that these "warehouse assistants" are hard to talk to. Currently, you have to give them very specific, low-level instructions (like "move byte 402 to address 99") using a special, confusing language. It's like trying to hire a new assistant but only being able to communicate via Morse code. It's efficient for the machine, but a nightmare for humans.

The Solution: Proxics
The authors of this paper created Proxics, a new way to talk to these assistants. They decided to use the same "language" we already use for our computers: Processes and Pipes.

Here is how Proxics works, using simple analogies:

1. The "Spawn" (Hiring an Assistant)

Instead of the complex "forking" (copying the whole boss) used in normal computers, Proxics uses a spawn() command.

  • Analogy: Imagine you don't clone yourself to go to the warehouse. Instead, you just hand a specific, pre-written instruction manual (a small program) to a new assistant sitting at the warehouse desk. You say, "Here are the rules, go do this specific task." It's lightweight and fast.

2. The "Pipes" (The Messenger Tubes)

This is the most important part. In normal computers, moving data between the main room and the warehouse is slow and clunky. Proxics introduces Pipes.

  • Analogy: Imagine a network of pneumatic tubes (like in old banks or hospitals) connecting the main reading room directly to the warehouse desks.
    • Old Way: You write a note, walk to the warehouse, hand it to the assistant, wait, walk back, get the result.
    • Proxics Way: You drop a note into a tube. The assistant grabs it instantly, does the work, and shoots the result back through another tube.
    • Why it matters: The paper shows that if these tubes are slow, the whole system fails. Proxics makes these tubes incredibly fast, allowing the main computer and the warehouse assistants to chat constantly without getting tired.

3. The "Split-Phase" (Multitasking)

The assistants in the warehouse don't have super-fast brains (they are slower than the main CPU), but they are right next to the books.

  • Analogy: When an assistant needs to fetch a book, they don't stand there staring at the shelf waiting for it. They shout "I need Book A!" and immediately start working on "Book B" while the conveyor belt brings Book A. By the time they are ready for Book A, it's already there. Proxics teaches the assistants to do this "shout and work" trick automatically.

Why is this a Big Deal?

The researchers built a real prototype using actual hardware (FPGAs) to prove this works. They tested it on three types of tasks:

  1. Bulk Operations: Like copying a million books. The assistant does it much faster because they don't have to walk back to the main room.
  2. Databases: Imagine a librarian trying to find all books with "Dragon" in the title. The assistant can scan the shelves right there and only send the matching titles back, saving a ton of walking time.
  3. Graphs (Social Networks): Imagine tracing connections between people. This is messy and unpredictable. Proxics allows multiple assistants to talk to each other through the tubes to solve the puzzle together, which is something older systems couldn't do well.

The "Gotcha" (The Catch)

The paper also admits a tricky part: Cache Coherence.

  • Analogy: Imagine the main librarian and the warehouse assistant both have a copy of the same book. If the librarian updates a page in their copy but forgets to tell the assistant, the assistant might read the old page.
  • Because the "warehouse" (Far Memory) doesn't automatically sync with the "main room" (CPU) in current technology, the programmer has to be very careful to tell the system when to "flush" (update) the information. The paper even tested if an AI (LLM) could write code for this and found that even AI struggles with these low-level details!

The Bottom Line

Proxics is a bridge. It takes the powerful, efficient idea of "computing near the data" and wraps it in a familiar, easy-to-use package (Processes and Pipes). It proves that if we build the right "tubes" (communication channels) between our main computers and these new warehouse assistants, we can make data centers faster, cheaper, and much more efficient without needing to be a hardware genius to use them.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →