Pre-trained LLMs Meet Sequential Recommenders: Efficient User-Centric Knowledge Distillation

This paper proposes an efficient knowledge distillation framework that leverages pre-trained LLMs to generate textual user profiles for sequential recommenders, thereby enhancing user semantic understanding without incurring real-time LLM inference costs or requiring architectural modifications.

Original authors: Nikita Severin, Danil Kartushov, Vladislav Urzhumov, Vladislav Kulikov, Oksana Konovalova, Alexey Grishanov, Anton Klenitskiy, Artem Fatkulin, Alexey Vasilev, Andrey Savchenko, Ilya Makarov

Published 2026-04-24✓ Author reviewed
📖 4 min read☕ Coffee break read

This is an AI-generated explanation of the paper below. It is not written by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are a personal shopper for a massive department store. Your job is to guess what a customer will buy next based on what they've bought before.

The Problem: The "Old School" Shopper vs. The "Genius" Shopper

1. The Old School Shopper (Traditional Recommenders):
This is like a shop assistant who only looks at a customer's receipt.

  • Scenario: "You bought shampoo, then conditioner, then a hairdryer. So, you probably want a hairbrush next."
  • The Flaw: They are great at spotting patterns, but they don't understand why. They don't know if the customer loves "eco-friendly" products, hates "cruelty-free" items, or is just buying gifts for a friend. They are efficient and fast, but a bit shallow.

2. The Genius Shopper (Large Language Models - LLMs):
This is like a highly educated consultant who can read the customer's entire diary, social media, and reviews.

  • Scenario: "Ah, I see you bought that organic face cream. You seem to value natural ingredients and have sensitive skin. You also rated that synthetic nail polish poorly because it smelled bad. You're a 'discerning beauty enthusiast' who prioritizes visible results."
  • The Flaw: This genius is incredibly smart, but they are slow and expensive to hire. If you try to ask this consultant to make a recommendation for every single customer the moment they walk into the store, the line will stretch out the door, and the store will go bankrupt paying their hourly rate.

The Solution: The "Mentorship" Program (Knowledge Distillation)

The authors of this paper came up with a clever way to get the best of both worlds. They didn't try to hire the Genius Shopper to work the register. Instead, they set up a Mentorship Program.

Here is how it works, step-by-step:

Step 1: The Interview (Offline Phase)

Before the store opens, the "Genius Shopper" (the LLM) sits down with the store manager. The manager gives the Genius a list of 100,000 customers and their purchase histories.

  • The Genius reads through them and writes a detailed personality profile for each customer (e.g., "User 405 is a budget-conscious tech geek who loves sci-fi movies").
  • Crucial Point: This happens once, before the store opens. It's slow, but it's okay because it's offline.

Step 2: The Training (Distillation Phase)

Now, the "Old School Shopper" (the fast, traditional model) starts training.

  • The manager shows the Old School Shopper a customer's purchase history.
  • The Old School Shopper makes a guess.
  • Then, the manager says, "No, look at the Genius's Profile for this customer. The Genius says this person loves 'organic skincare.' Try to make your internal 'brain' feel the same way about this customer."
  • The Old School Shopper adjusts its internal settings to mimic the understanding of the Genius, without needing the Genius to be there. It learns to "think" like the Genius by studying the profiles the Genius wrote.

Step 3: The Grand Opening (Serving Phase)

The store opens!

  • A customer walks up.
  • The Old School Shopper is now working the register.
  • Because of the training, when the customer buys a face mask, the Old School Shopper instantly thinks, "Ah, this is the 'organic skincare' person! I should recommend that new organic serum!"
  • The Magic: The Old School Shopper is still fast (like a normal computer) and cheap to run, but it now has the wisdom of the Genius Shopper in its head.

Why is this a big deal?

  1. Speed: You don't have to wait for the slow Genius to think during the transaction. The recommendation happens instantly.
  2. Smarts: The recommendations are much better because they understand the person, not just the items.
  3. No Re-invention: You don't have to rebuild the whole store or hire new staff. You just train the existing staff to think a bit deeper.

The Results

The paper tested this on four different types of "stores" (Beauty products, Movies, etc.).

  • Accuracy: The trained "Old School" shoppers became significantly better at guessing what people wanted next (up to 23% better in some cases).
  • Efficiency: They were still lightning-fast, whereas trying to use the "Genius" directly would have been 50 to 180 times slower.

In a nutshell: They taught a fast, simple robot to understand human feelings by having it study the notes written by a slow, super-smart human, so the robot can make smart decisions instantly without needing the human around.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →