IntPro: A Proxy Agent for Context-Aware Intent Understanding via Retrieval-conditioned Inference

The paper proposes IntPro, a proxy agent that enhances context-aware intent understanding by leveraging retrieval-conditioned inference from an individual intent history library, trained via supervised fine-tuning and multi-turn GRPO to effectively adapt to user-specific patterns across diverse scenarios.

Guanming Liu, Meng Wu, Peng Zhang, Yu Zhang, Yubo Shu, Xianliang Huang, Kainan Tu, Ning Gu, Liuxin Zhang, Qianying Wang, Tun Lu

Published 2026-03-05
📖 4 min read☕ Coffee break read

Imagine you are talking to a very smart, but slightly distant, AI assistant (like a super-charged version of Siri or a chatbot). Sometimes, this AI gets your request wrong because it doesn't "know" you. It sees the words you type, but it misses the vibe, your past habits, or the specific reason you're asking.

The paper introduces IntPro, a solution to this problem. Think of IntPro not as the main AI, but as your personal "Context Coach" or a smart interpreter sitting between you and the big AI.

Here is how it works, broken down with simple analogies:

1. The Problem: The "Amnesiac" AI

Imagine you are at a restaurant. You say, "I'll have the usual."

  • Without IntPro: The waiter (the AI) looks at you blankly. "What is usual? I don't know you. Do you want the steak? The salad? The soup?" It treats every order as a brand-new stranger.
  • The Issue: Current AIs are great at reading text, but they are bad at remembering who you are and why you usually do things. They miss the "context."

2. The Solution: IntPro (The Personal Coach)

IntPro is a small, specialized agent that sits between you and the big AI. Its job is to figure out what you really mean before passing the message along.

It does this in two clever ways:

A. The "Intent Explanation" (The Translator)

Instead of just guessing your intent (e.g., "He wants food"), IntPro writes a short, human-like note explaining why.

  • Analogy: Instead of just handing the waiter a ticket that says "Order," IntPro writes a note: "This customer is stressed about work and usually orders the spicy soup to comfort themselves. They are likely asking for the spicy soup."
  • This note is stored in a Personal Library for that specific user.

B. The "Smart Memory Check" (The Librarian)

This is the magic part. IntPro doesn't always guess. It knows when to check its memory.

  • The Easy Case: If you say "I want pizza," IntPro knows immediately. It doesn't need to check the library. It just passes the order.
  • The Tricky Case: If you say something vague like "Ugh, do that again," IntPro gets confused. Is it a joke? Are you angry?
    • The Action: IntPro acts like a librarian. It goes to your Personal Library, looks at your past notes, and finds similar situations.
    • The Discovery: It finds a past note: "Last time you said 'do that again,' you were annoyed at your boss."
    • The Result: IntPro now knows you are annoyed, not joking. It updates the note for the big AI: "User is annoyed, likely complaining about a recurring task."

3. How It Learned to Be Smart (The Training)

You might wonder, "How does this coach know when to check the library and when to just guess?"

The researchers taught IntPro using a method called Reinforcement Learning (like training a dog with treats).

  • The Game: They gave IntPro thousands of scenarios.
  • The Reward:
    • If the situation was easy and IntPro guessed right without checking the library, it got a small treat (because checking the library takes time).
    • If the situation was hard and IntPro guessed wrong without checking, it got a "scolding."
    • If the situation was hard and IntPro checked the library and got it right, it got a big treat.
    • If the situation was easy but IntPro wasted time checking the library, it got a small scolding.

Over time, IntPro learned the perfect balance: "Be fast when you know the answer, but dig deep into the memory when you're unsure."

4. Why This Matters

  • Privacy: Because IntPro is small and runs on your device (like your phone or laptop), your personal history stays with you. You don't have to send your private thoughts to a giant cloud server.
  • Speed: It's much faster than waiting for a giant server to think, because IntPro is a lightweight "coach" that knows exactly what to look for.
  • Understanding: It stops the AI from being a "one-size-fits-all" robot and makes it feel like it actually knows you.

Summary

IntPro is like a personal translator and memory keeper for your AI. It listens to you, checks your personal history if things are confusing, writes a clear explanation of what you mean, and then tells the big AI exactly how to respond. It makes the AI feel less like a machine and more like a friend who remembers your habits.