DeepInterestGR: Mining Deep Multi-Interest Using Multi-Modal LLMs for Generative Recommendation

DeepInterestGR addresses the limitations of shallow behavioral signals in generative recommendation by leveraging multi-modal LLMs to mine deep, reward-labeled multi-modal interests and integrating them into a two-stage training pipeline that significantly outperforms state-of-the-art baselines on Amazon benchmarks.

Yangchen Zeng

Published 2026-02-24
📖 4 min read☕ Coffee break read

Imagine you are at a massive, endless library where the librarian (the recommendation system) tries to guess your next favorite book.

The Old Way (The "Shallow" Problem):
Currently, most librarians only look at the title and the back cover of the books you've borrowed. If you borrowed a book called "Running 101," they assume you like "running." But they don't know why. Do you run for fitness? For stress relief? To train for a marathon? Or just because you hate sitting still?

Because they only see the surface, their suggestions are often generic. They might suggest another running book, but you actually wanted a book about meditation because running was just your way to clear your mind. This is what the paper calls the "Shallow Interest" problem: the system sees the what, but misses the why.

The New Solution: DeepInterestGR
The authors propose a new system called DeepInterestGR. Instead of just reading the title, this system hires a team of super-smart detectives (Large Language Models, or LLMs) to dig deeper into your history.

Here is how it works, broken down into three simple steps:

1. The Detective Team (Multi-LLM Interest Mining)

Imagine you have a team of four different detectives: one is a logic expert, one is a creative artist, one is a data scientist, and one is a psychologist.

  • The Job: When you interact with an item (like buying "noise-canceling headphones"), these detectives don't just see "headphones." They use their reasoning powers to ask: "Why did they buy this?"
  • The Insight: They might conclude: "This user values silence for deep focus," or "They are a frequent traveler who hates airplane noise."
  • The Teamwork: Since different detectives have different strengths, the system combines their notes. If three of them agree the user likes "focus," that becomes a strong signal. This is called Multi-LLM Interest Mining.

2. The Secret Codebook (Interest-Enhanced Discretization)

Now, the system has a list of your deep interests (e.g., "focus," "travel," "aesthetic beauty"). But computers can't just read a list of words; they need a code.

  • The Translation: The system translates these deep insights into a special Secret Code (called Semantic IDs).
  • The Magic: Instead of just grouping items by their title, it groups them by their soul. A "noise-canceling headphone" and a "quiet study tent" might get similar codes because they both satisfy the deep interest of "focus." This allows the system to recommend things you didn't even know you wanted, as long as they fit your hidden profile.

3. The Coach with a Whistle (Reinforcement Learning with Reward)

Finally, the system needs to learn from its mistakes. In the old days, the system only got a "Good job!" if it guessed the exact item you clicked on.

  • The New Coach: This system has a special coach (the Interest-Aware Reward).
  • How it works: If the system recommends a "quiet study tent" and you click it, the coach gives a huge bonus, even if you didn't buy headphones. Why? Because the recommendation matched your deep interest (focus), not just the surface item.
  • The Result: The system learns to stop guessing random titles and starts guessing based on your true personality and motivations.

Why Does This Matter?

The paper tested this on real-world data (like Amazon reviews for beauty products, sports gear, and musical instruments).

  • The Result: The new system was 10% to 15% better than the best existing systems.
  • The Analogy: It's the difference between a vending machine that only knows you like "Soda" (Surface) versus a personal chef who knows you like "Soda" because you need a quick energy boost after a long run (Deep Interest). The chef can then suggest a protein bar or a new running shoe, and you'll love it.

In short: DeepInterestGR stops treating you like a list of clicks and starts treating you like a human with complex, hidden motivations. It uses AI detectives to find out who you really are, not just what you bought.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →