UIS-Digger: Towards Comprehensive Research Agent Systems for Real-world Unindexed Information Seeking

This paper identifies the critical limitation of current LLM-based agents in accessing unindexed information, introduces the first dedicated UIS-QA benchmark to quantify this challenge, and proposes UIS-Digger, a multi-agent framework that significantly outperforms state-of-the-art models by effectively combining dual-mode browsing and file parsing to retrieve vital unindexed data.

Chang Liu, Chuqiao Kuang, Tianyi Zhuang, Yuxin Cheng, Huichi Zhou, Xiaoguang Li, Lifeng Shang

Published Tue, 10 Ma
📖 4 min read☕ Coffee break read

Imagine the internet as a massive, sprawling library.

For a long time, AI assistants (the "librarians") have been incredibly good at finding books that are already cataloged on the main shelves. If you ask, "Who won the 2024 World Cup?", they instantly pull the right book off the shelf. This is what the paper calls Indexed Information Seeking (IIS). The search engines have already read these pages, summarized them, and put them in a neat index.

The Problem: The "Hidden" Library

But here's the catch: A huge part of the library is unindexed. These are the books hidden in the back rooms, the files tucked inside drawers, the dynamic displays that change every minute, and the pages that search engine robots (crawlers) simply can't read or aren't allowed to enter.

The paper argues that current AI librarians are terrible at finding these hidden items. If you ask a question that requires digging into a specific PDF on a government website, or clicking through a complex menu to find a date, the AI often gives up, guesses wildly (hallucinates), or says, "I don't know," even though the answer is right there if you just knew how to look.

The authors call this missing skill Unindexed Information Seeking (UIS).

The Solution: UIS-QA (The Test)

To prove how bad the current situation is, the researchers created a new test called UIS-QA.

  • The Analogy: Imagine giving a librarian a list of 110 very specific questions. Some ask for facts hidden inside a company's annual report PDF, others ask for data hidden behind a login-free but complex dropdown menu on a museum's site.
  • The Result: The top-tier AI librarians, who usually score 70% or higher on standard tests, crashed on this new test. They only got about 25% right. It was like a brilliant scholar failing a test because they didn't know how to open a locked filing cabinet.

The Hero: UIS-Digger (The New Librarian)

To fix this, the team built a new AI system called UIS-Digger. Think of it not as a single librarian, but as a specialized excavation team.

  1. The Team Structure: Instead of one brain trying to do everything, UIS-Digger uses a team of four specialized agents:

    • The Planner: The boss who breaks the big question into small steps.
    • The Searcher: The one who uses Google to find the general area.
    • The Surfer: The explorer who actually goes into the website. Crucially, this agent has two modes: it can read text quickly, but if the page has a chart or a weird button, it can switch to "visual mode" (like taking a screenshot) to understand what it's seeing.
    • The Reader: The specialist who opens downloaded files (PDFs, Excel sheets) and reads them line by line.
  2. The Training (SFT & RFT):

    • SFT (Supervised Fine-Tuning): They taught the team by showing them examples of how to solve these hard problems. It's like giving them a map and saying, "Here is how you open that specific drawer."
    • RFT (Rejection Sampling Fine-Tuning): This is the "try, fail, learn" phase. The team tried to solve thousands of problems. If they got the answer right, they kept the method. If they failed or took a silly path, that attempt was thrown away. Over time, the team learned to be much more efficient and less likely to get lost.

The Results

The new team, UIS-Digger, didn't just beat the old librarians; it completely changed the game.

  • While the best existing systems scored around 25% on the hard test, UIS-Digger scored 27.3%.
  • The Twist: They did this using a relatively small, efficient brain (a 30-billion parameter model), beating systems that use massive, expensive super-brains (like O3 or GPT-4.1).

Why This Matters

The paper concludes that we can't just rely on bigger and bigger AI brains to solve these problems. We need to teach them how to interact with the messy, unindexed parts of the internet.

In a nutshell:
Current AI is like a genius who can recite the encyclopedia but can't find the hidden treasure map in the attic. UIS-Digger is a new kind of AI that learns to climb ladders, open locked boxes, and read the fine print in the dark, proving that for real-world research, how you look is just as important as how smart you are.