GateANN: I/O-Efficient Filtered Vector Search on SSDs

GateANN is an I/O-efficient SSD-based system for filtered vector search that achieves significant performance gains by decoupling graph traversal from vector retrieval, enabling in-memory filtering to bypass unnecessary disk reads for non-matching nodes without requiring index rebuilds.

Original authors: Nakyung Lee, Soobin Cho, Jiwoong Park, Gyuyeong Kim

Published 2026-03-24
📖 5 min read🧠 Deep dive

Original authors: Nakyung Lee, Soobin Cho, Jiwoong Park, Gyuyeong Kim

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are a librarian in a massive library with 100 million books. You don't have enough shelf space to keep all the books on the main desk, so most of them are stored in a giant, high-speed warehouse (the SSD) down the hall. You only keep a small, compressed "cheat sheet" for each book on your desk to help you guess which ones are close to what a reader is looking for.

When a reader asks, "Find me the 10 best mystery novels written by female authors published after 2020," you have a problem.

The Old Way: The "Grab Everything" Approach

In the current systems (like DiskANN or PipeANN), here is what happens:

  1. You look at your cheat sheet and pick a few promising books.
  2. You run down to the warehouse, grab the full book (which takes time), and bring it back.
  3. You open the book, read the author's name and the date, and realize, "Oh, this is a sci-fi book by a male author from 1990."
  4. You throw it in the trash and run back to the warehouse to get the next one.

The Problem: If only 10% of the books match the reader's request, you are wasting 90% of your time running to the warehouse, grabbing the wrong books, and throwing them away. You are doing all that running just to find out the books are useless.

The "Naive" Alternative: The "Don't Go" Approach

You might think, "Why not just check the cheat sheet first? If the cheat sheet says 'Male Author,' don't go to the warehouse!"

The Problem: In a graph search, the books are connected by invisible strings. Even if a book doesn't match the criteria, it might be the only bridge connecting you to the next section of the library where the good books are. If you skip it, you might get stuck in a dead end and miss the best books entirely. This is called "breaking the graph."

The New Solution: GateANN (The Smart Gatekeeper)

The authors of this paper built a system called GateANN. It solves the problem by separating the map from the book.

Here is how it works, using our library analogy:

1. The "Map" vs. The "Book"

GateANN realizes that to navigate the library (traverse the graph), you don't need the full book. You only need two things:

  • The Map: A list of which books are connected to the current one (the neighbor list).
  • The Distance Guess: A rough idea of how close the book is to the reader's request (an approximate distance).

Both of these are tiny and can fit on your desk (in memory). You don't need to run to the warehouse to get them.

2. The "Gate" (Pre-Filtering)

Before you even think about running to the warehouse, you check the Map on your desk against the reader's request ("Female Author, Post-2020").

  • If the book matches: Great! You run to the warehouse, grab the full book, and check the exact details.
  • If the book doesn't match: You don't run to the warehouse! Instead, you use the Map on your desk to see where the next connected books are. You "tunnel" through this non-matching book entirely in your head (in memory).

3. "Graph Tunneling"

This is the magic trick. Imagine a non-matching book is a tunnel entrance.

  • Old Way: You walk into the tunnel, realize it's a dead end, and waste time.
  • GateANN Way: You stand at the entrance, look at the map, and instantly see the path leading through the tunnel to the next section. You skip the walk inside the tunnel entirely. You treat the non-matching book as a bridge rather than a destination.

Why This is a Big Deal

  • Speed: Because you stop running to the warehouse for the wrong books, you save massive amounts of time. The paper shows this can make the system 7.6 times faster.
  • No Rebuilding: Some other systems try to solve this by building a special "Female Author Only" library section. But if the rules change (e.g., "Find books by any author"), you have to tear down the whole library and rebuild it. GateANN works with the existing library without changing a single shelf.
  • Memory Cost: The only cost is keeping a slightly larger "Map" on your desk. But since the desk is fast and the warehouse is slow, this trade-off is totally worth it.

The Bottom Line

GateANN is like a librarian who learned to read the spine of the book (the metadata) before walking to the shelf. If the spine doesn't match the request, they don't walk to the shelf at all; they just use a mental map to jump straight to the next promising section. This saves them from millions of wasted trips, making the search incredibly fast without ever losing the best books.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →