Imagine you are at a massive, chaotic library (a video search engine like WeChat Channels). You walk up to the librarian and say, "I want to see Guang Liang."
The librarian is confused. Is Guang Liang a famous singer? Is it a brand of liquor? Without knowing who you are, the librarian has to guess. If they guess wrong, you get frustrated, say, "No, I meant the liquor," and ask again. This is the problem the paper solves.
The authors, from Tencent, built a smart system called WeWrite. Think of it as a Super-Intelligent Personal Assistant that stands next to the librarian, whispering the exact right request based on what they know about you.
Here is how they built this assistant, broken down into three simple steps:
1. The "When" Question: Knowing When to Whisper
The Problem: If your assistant whispers a suggestion for every question you ask, it becomes annoying. If you ask, "How do I cook an air fryer?", your assistant shouldn't suddenly suggest "Funny air fryer pranks for couples" just because you watched a comedy last week. That would be a distraction, not a help.
The Solution (Posterior Mining):
The team taught the assistant to look at your past behavior to decide when to speak up.
- The "Frustration Signal": They looked at logs where people asked a question, got bad results (didn't watch the video), and immediately asked a new question.
- The "Aha! Moment": They checked if the new question was related to things the user had watched before.
- The Filter: They used a "Teacher AI" to double-check: "Did this user actually need help because of their history, or did they just make a typo?"
- The Result: The assistant now only whispers when it's 100% sure you need a personalized nudge. If you ask a clear, functional question (like "air fryer recipes"), it stays silent.
2. The "How" Question: Learning to Speak the Library's Language
The Problem: Even if the assistant knows what you want, it might write the request in a weird way that the library's computer system can't understand. Imagine the assistant whispering, "Show me the spicy liquid that makes people happy," when the library only understands the word "Liquor." The library would return zero results.
The Solution (SFT + GRPO):
They trained the assistant in two stages:
- Stage 1 (The Student - SFT): They showed the assistant thousands of examples of "Bad Request → Good Request" pairs. The assistant learned to mimic these corrections, just like a student copying a teacher's handwriting.
- Stage 2 (The Coach - GRPO): This is the clever part. They didn't just let the assistant guess; they gave it a scorecard.
- If the assistant wrote a query that the library system could easily find (high "Index Hit Rate"), it got a gold star.
- If it wrote a fancy, confusing query that the library couldn't find, it got a penalty.
- The assistant practiced thousands of times, learning to write requests that are not only personal to you but also perfectly formatted for the library's database.
3. The "Speed" Question: Doing It Without Waiting
The Problem: Smart assistants usually take a long time to think. In a video app, if you wait 2 seconds for a result, you'll just scroll away. You can't wait for the AI to think before showing you videos.
The Solution (Fake Recall):
They built a parallel highway.
- The Main Road: The traditional search engine starts looking for videos immediately (Text/Vector search).
- The Side Road: At the exact same time, the AI assistant starts thinking about your personalized rewrite.
- The Magic Cache: The assistant doesn't search the whole library. It checks a pre-made "Cheat Sheet" (a Fake Index) that already contains the top results for popular personalized queries.
- The Merge: By the time the Main Road finishes gathering results, the Side Road has already grabbed the personalized ones from the Cheat Sheet. They are merged together instantly. You get the best of both worlds with zero extra waiting time.
The Real-World Result
When they tested this in the real world:
- More Happy Viewers: People watched videos for longer (over 10 seconds) because the results actually matched what they wanted.
- Less Frustration: People stopped having to re-type their search queries because the system understood them the first time.
In a nutshell: WeWrite is a smart, fast, and polite assistant that knows exactly when to help you find what you want, speaks the language of the search engine perfectly, and does it all without making you wait a single second.
Get papers like this in your inbox
Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.