WebExpert: domain-aware web agents with critic-guided expert experience for high-precision search

WebExpert is a domain-aware web agent that enhances high-precision search in specialized fields like finance and biomedicine by integrating sentence-level experience retrieval, weakly supervised facet induction, and preference-optimized planning to significantly improve answer accuracy and reduce navigation steps compared to existing baselines.

Yuelin Hu, Zhengxue Cheng, Ronghua Wu, Qunshan Gu, Hongwei Hu, Wei Liu, Qiao Liang, Li Song

Published 2026-04-09
📖 4 min read☕ Coffee break read

Imagine you are trying to solve a very tricky puzzle, like figuring out the best way to invest your money or understanding a complex medical diagnosis. You decide to ask a very smart, fast robot to search the internet for you.

The Problem with the "Generic" Robot
Most current web-search robots are like generalist tourists. They are fast and can read a lot, but they don't know the local customs.

  • If you ask, "What's the best time to buy stocks?", a tourist might just search for "best time to buy stocks."
  • They might miss crucial details like which country you are in, what the current laws are, or what season it is.
  • As a result, they wander around the internet, visit too many irrelevant websites, and often give you an answer that sounds good but is actually wrong or outdated. They waste time and get confused by "noise."

The Solution: Meet "WebExpert"
The researchers behind this paper built a new kind of robot called WebExpert. Think of WebExpert not as a tourist, but as a seasoned local guide who has read thousands of expert manuals and learned from the mistakes of others before you.

Here is how WebExpert works, broken down into three simple steps using analogies:

1. The "Experience Library" (Critic-Guided Extraction)

Before WebExpert even starts searching, it opens a special library of "Expert Notes."

  • How it works: The team took thousands of real-world questions and answers (like "How does asset correlation affect diversification?") and asked a super-smart AI to read them.
  • The Magic: Instead of just saving the whole answer, the AI acts like a critic editor. It cuts out the fluff and extracts the core rule.
    • Example: Instead of saving a 5-page article, it saves a sticky note that says: "Diversification works best when assets are uncorrelated. Always check the time period and region."
  • The Result: WebExpert has a mental cheat sheet of "rules of thumb" for specific topics like finance, medicine, or law.

2. The "Smart Search Plan" (Schema-Light Facet Induction)

When you ask a question, WebExpert doesn't just type it into Google. It first checks its Expert Notes to see what specific details (facets) matter.

  • The Analogy: Imagine you are ordering a pizza. A generic robot just orders "Pizza." WebExpert asks itself: "Wait, the expert notes say for 'Finance' questions, I need to know the Region, the Time, and the Policy."
  • The Action: It automatically builds a better search query. Instead of "best stocks," it searches for "best stocks for US investors in 2024 under current SEC regulations."
  • The Safety Net: If the robot isn't sure which expert note to use, it has a "fallback" mode. It doesn't guess; it switches to a safe, general search so it doesn't get stuck in a loop.

3. The "Deep Dive" (Preference-Optimized Planning)

Once it has a great search plan, WebExpert goes out and explores the web.

  • The Difference: Because it started with a precise plan, it doesn't need to click through 10 different websites to find the answer. It goes straight to the right page.
  • The Training: The robot was trained using a "reward system." If it finds the right evidence quickly, it gets a "gold star." If it wanders off-topic, it gets a "red card." Over time, it learned to be extremely efficient.

Why Does This Matter? (The Results)

The researchers tested WebExpert on hard puzzles involving finance, science, and general knowledge.

  • Generic Robots: Often got the answer wrong or took too many steps (like walking 8 miles to find a cup of coffee).
  • WebExpert: Got the answer right 1.5% to 3.6% more often (which is huge in AI!) and took fewer steps (only 5.2 steps instead of 8.1).

The Big Picture

Think of WebExpert as upgrading from a flashlight to a GPS with a local guide.

  • The flashlight (generic AI) just shines light everywhere, hoping to see something useful.
  • The GPS (WebExpert) knows the terrain, knows the traffic rules, and knows exactly which turn to take to get you to your destination without getting lost.

This paper shows that by giving AI access to "expert experience" and teaching it to ask better questions before it starts searching, we can make it much smarter, faster, and more reliable in specialized fields like medicine and finance.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →