SOLAR: SVD-Optimized Lifelong Attention for Recommendation

The paper introduces SOLAR, a recommendation framework that employs SVD-Optimized Attention to achieve theoretically lossless, low-rank sequence modeling with reduced computational complexity, enabling efficient processing of ultra-long user behavior sequences and delivering significant performance gains in Kuaishou's online recommendation system.

Chenghao Zhang, Chao Feng, Yuanhao Pu, Xunyong Yang, Wenhui Yu, Xiang Li, Yongqi Liu, Lantao Hu, Kaiqiao Zhan, Han Li, Kun Gai

Published 2026-03-04
📖 4 min read☕ Coffee break read

Imagine you are a Talent Scout for a massive music festival. Every day, you have to choose the best 3,000 bands to play from a library of millions of songs. But here's the catch: you also have to remember the listening history of every single fan in the crowd. Some fans have listened to 10,000 songs in the past.

Your job is to match the right band to the right fan, right now.

The Problem: The "Memory Overload"

In the past, the standard way to do this (called Standard Attention) was like a super-organized but incredibly slow librarian.

  • To pick a band for a fan, the librarian had to compare that fan's entire 10,000-song history against every single one of the 3,000 bands currently on stage.
  • The Math: This creates a massive grid of comparisons. If you have 10,000 history items and 3,000 candidates, you are doing 30 million comparisons per fan.
  • The Result: It's so slow and expensive that companies usually just say, "Forget the last 9,900 songs. Let's only look at the last 50." This is like judging a fan's taste based only on what they listened to this morning, ignoring their whole life's musical journey.

The Failed Shortcut: The "Blurry Lens"

Scientists tried to speed this up with Linear Attention.

  • The Analogy: Imagine instead of comparing every song to every band, you just take a "blurry average" of the fan's history.
  • The Problem: This is fast, but it's too blurry. It loses the specific details. It's like trying to pick a specific song for a fan by only knowing they "like music." You miss the nuance. It changes the rules of the game, making the recommendations feel flat and less accurate.

The Solution: SOLAR (The "Smart Summarizer")

The authors of this paper, working with the app Kuaishou (a massive video platform), realized something brilliant: Fan listening habits aren't random chaos; they have a pattern.

Even though a fan has 10,000 songs, they probably only really care about 3 or 4 main "vibes" (e.g., "Sad Ballads," "Upbeat Pop," "Old School Hip Hop"). The other 9,990 songs are just variations of those few core vibes.

They invented SOLAR, which uses a mathematical trick called SVD (Singular Value Decomposition).

How SOLAR Works (The Creative Analogy)

Imagine the fan's 10,000-song history is a giant, messy pile of 10,000 different colored marbles.

  1. The Old Way: You try to match every single marble to every single band. (Too slow).
  2. The Linear Way: You dump the marbles into a blender and make a smoothie. (Too vague).
  3. The SOLAR Way: You realize that 99% of those marbles are just shades of Red, Blue, and Green.
    • SOLAR instantly compresses the 10,000 marbles down to just 3 representative colors (Red, Blue, Green).
    • It then matches the bands to these 3 colors.
    • The Magic: Because the math proves that the "Red, Blue, Green" summary contains all the important information, you haven't lost any detail. You've just removed the redundancy.

Why This is a Big Deal

  1. It Keeps the "Softmax" (The Fairness): Previous fast methods threw away the "Softmax" mechanism, which is the mathematical rule that ensures the best options stand out. SOLAR keeps this rule intact, so the recommendations remain sharp and competitive.
  2. It's "Lifelong": Because it's so fast, the system can now look at a fan's entire 10,000-song history without cutting anything off. It understands the user's whole life, not just their last 50 clicks.
  3. It Handles the Crowd: It can also look at the 3,000 bands on stage all at once, understanding that if a fan likes "Sad Ballads," they might prefer Band A over Band B because Band C is also playing (context matters).

The Real-World Result

When Kuaishou put SOLAR to work:

  • Speed: It became much faster, saving money on computer servers.
  • Accuracy: It started recommending videos that people actually wanted to watch.
  • Impact: They saw a 0.68% increase in video views. In the world of billions of users, that tiny percentage translates to millions of extra views and a happier audience.

Summary

SOLAR is like hiring a genius assistant who can read a fan's entire 10,000-item history in a split second, summarize it into its core "vibes," and instantly pick the perfect video from a list of thousands, all while keeping the recommendations fair and accurate. It solves the "too much data" problem by realizing that most of the data is just repetition, and we only need to look at the unique patterns.