DAXFS: A Lock-Free Shared Filesystem for CXL Disaggregated Memory

This paper presents DaxFS, a novel Linux filesystem that leverages CXL's hardware cache coherence and atomic operations to enable lock-free, decentralized multi-host coordination and high-performance shared memory access without centralized bottlenecks.

Original authors: Cong Wang, Yiwei Yang, Yusheng Zheng

Published 2026-04-03
📖 6 min read🧠 Deep dive

Original authors: Cong Wang, Yiwei Yang, Yusheng Zheng

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

The Big Picture: The "Shared Whiteboard" Problem

Imagine a group of friends (servers) who want to work on a giant, shared whiteboard (memory) in a room.

  • The Old Way (Traditional Filesystems): Usually, if two people want to edit a document, they have to send it back and forth over the mail (network), or they have to wait in line for a single "manager" to tell them when they can write. This is slow and creates traffic jams.
  • The New Hardware (CXL): A new technology called CXL is like a super-fast, magical whiteboard where everyone can see changes instantly, and the hardware itself guarantees that if two people try to write on the same spot at the exact same time, the system sorts it out without them crashing into each other.
  • The Problem: Until now, we didn't have a "rulebook" (filesystem) that knew how to use this magical whiteboard efficiently. Existing rulebooks were too slow, required a manager, or made everyone copy the whiteboard onto their own desks (wasting space).

DaxFS is the new rulebook designed specifically for this magical whiteboard. It lets multiple computers write to the same memory at the same time, instantly, without a manager, and without wasting space.


Key Concepts & Analogies

1. The "No-Manager" Rule (Lock-Free Coordination)

In a normal office, if two people want to edit the same spreadsheet, one has to lock the file, edit it, and unlock it. The other person has to wait.

  • DaxFS Approach: Imagine a group of friends playing a game where they can all write on the whiteboard simultaneously. If two people try to write on the same square, they use a special "magic pen" (called cmpxchg). The pen only works if the square is empty. If it's already taken, the person just tries the next square.
  • The Result: No one ever has to wait in line. There is no "manager" telling people when to write. It's pure, chaotic, but perfectly organized speed.

2. The "Smart Filing Cabinet" (The Hash Overlay)

How do you find a specific file on a whiteboard that has millions of scribbles?

  • Old Way: You might have to walk down every row of the whiteboard looking for the name.
  • DaxFS Approach: DaxFS uses a Hash Overlay. Think of this as a giant, magical index card system.
    • You want to find "File A"? You do a quick math trick to know exactly which drawer it's in.
    • If the drawer is empty, you drop your file in.
    • If the drawer is full, you try the next one immediately.
    • Because everyone uses the same magic math and the same "magic pen," multiple people can drop files into different drawers at the exact same time without bumping into each other.

3. The "Shared Library" (Cooperative Caching)

Usually, if Computer A reads a file, it keeps a copy in its own memory. If Computer B needs the same file, it makes a second copy. This wastes memory.

  • DaxFS Approach: Imagine a single, shared library in the middle of the room. Everyone reads from the same book.
  • The "MH-Clock" Algorithm: What happens if the library gets too full? Who throws a book out?
    • In normal systems, a librarian (a central daemon) decides what to throw away.
    • In DaxFS, everyone is their own librarian. They use a clever "Clock" system. Each computer looks at the books, checks if anyone is currently reading them (using a "refcount" tag), and if a book hasn't been touched in a while, they gently push it out to make room for a new one. They do this without asking permission, ensuring no one throws out a book someone is currently reading.

4. The "Zero-Copy" Superhighway

Normally, when a program wants data, the computer copies it from the hard drive -> to the system memory -> to the program. It's like moving a box from a truck to a warehouse, then to your house.

  • DaxFS Approach: DaxFS builds a direct tunnel. The program points its finger directly at the memory on the whiteboard. No moving boxes. No copying. It's "Zero-Copy."
  • Why it matters: This is crucial for AI. If you are loading a massive AI brain (a Large Language Model) into a GPU, DaxFS lets the GPU grab the data directly from the shared memory without the CPU getting in the way.

5. The "GPU" Connection

GPUs (the chips that power AI) are usually very fast but isolated. They usually have to ask the CPU to get data for them.

  • DaxFS Innovation: Because DaxFS is built on "magic pens" (atomic operations) that work over the same high-speed wires (PCIe) that connect GPUs, the GPU can actually participate in the filing system!
  • The Analogy: Imagine the GPU is a super-fast robot. Instead of waiting for a human (CPU) to fetch a book from the library, the robot can run up, check the index, and grab the book itself. The paper shows this works incredibly fast, limited only by the speed of the wires, not the software.

The Results: How Fast Is It?

The researchers tested DaxFS against the current gold standard for in-memory files (called tmpfs).

  1. Writing Speed: When multiple computers try to write random data at the same time, DaxFS was 2.68 times faster than the competition. It's like a highway where everyone can merge instantly without traffic lights.
  2. Reading Speed: It was also faster at reading, especially when many people are reading at once.
  3. Accuracy: Even when two computers tried to write to the exact same spot at the exact same time, DaxFS got it right 99%+ of the time with no lost data.

Why Should You Care?

  • For AI: Training AI models requires massive amounts of memory. DaxFS allows multiple computers to share that memory pool without duplicating data, saving money and speeding up training.
  • For Containers: If you run 100 containers that all need the same operating system files, DaxFS lets them all share one single copy in memory instead of 100 copies.
  • The Future: As CXL hardware becomes common in data centers, DaxFS is the software layer ready to unlock its full potential, turning a "shared memory" dream into a fast, practical reality.

In a nutshell: DaxFS is the traffic cop that doesn't stop traffic. It uses the hardware's superpowers to let thousands of computers write and read from a shared memory pool simultaneously, safely, and incredibly fast.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →