LLM-Enhanced Topical Trend Detection at Snapchat

This paper presents the first production-scale, end-to-end system for detecting topical trends on Snapchat, which integrates multimodal extraction, time-series burst detection, and LLM-based enrichment to significantly improve content freshness and user experience through global deployment.

Original authors: Hangqi Zhao, Jay Li, Abhiruchi Bhattacharya, Cong Ni, Jason Yeung, Jinchao Ye, Kai Yang, Akshat Malu, Manish Malik

Published 2026-05-01
📖 4 min read☕ Coffee break read

This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine Snapchat as a massive, bustling digital city where millions of people are constantly posting short videos. In this city, "trends" are like sudden, massive street festivals or viral dance crazes that pop up out of nowhere. The challenge for Snapchat is that this city is too big and moves too fast for any human team to watch every corner and spot these festivals as they begin.

This paper describes a new, automated "City Watch" system built by Snapchat engineers to spot these trends the moment they start, using a special kind of artificial intelligence called a Large Language Model (LLM).

Here is how the system works, broken down into four simple steps:

1. The "Eyes and Ears" (Topic Extraction)

First, the system needs to understand what is happening in the videos. Since videos are a mix of pictures, sounds, and text, the system uses a team of AI "detectives."

  • The Visual Detective: Looks at the video frames to see what objects or scenes are there (like a dog, a beach, or a concert).
  • The Audio Detective: Listens to what people are saying (speech-to-text).
  • The Text Detective: Reads any words written on the screen or in the caption.
  • The Summarizer: Once these clues are gathered, a powerful AI (the LLM) acts like a skilled journalist. It takes all that messy information and writes a short, clear headline for the video, like "People dancing to a new song" instead of just a list of random words.

2. The "Surge Detector" (Burst Detection)

Just because a video is about "dogs" doesn't mean it's a trend; people post about dogs every day. The system needs to know when something is suddenly popular.

  • Imagine a quiet street where usually 5 people walk by an hour. Suddenly, 500 people show up in the next hour. That's a "burst."
  • The system tracks how many unique people are posting about a specific topic. It ignores how many people are watching (to avoid bias) and focuses on how many are creating.
  • It uses a mathematical formula to compare today's numbers against the recent past. If the number of creators spikes significantly, the system flags it as a potential trend.

3. The "Quality Control" (Post-Processing)

Not every spike is a good trend. Sometimes a spike is just a glitch, spam, or something too vague like "funny videos."

  • The Filter: The system uses AI rules to throw out bad topics. It removes anything that is too broad (e.g., "life") or anything that violates safety rules (sensitive or unsafe content).
  • The Merger: Sometimes the system spots "World Cup 2026," "World Cup," and "World Cup qualifiers" as three separate things. The AI realizes these are actually the same event and merges them into one single, clean trend called "World Cup 2026." This keeps the list tidy and easy to understand.

4. The "Storyteller" (Trend Enrichment)

Once a trend is confirmed, the system doesn't just give it a name; it builds a profile for it.

  • It picks a few representative videos from the trend and asks a super-smart AI to write a summary, assign a category (like "Sports" or "News"), and list which countries are talking about it most.
  • Think of this as turning a raw data point into a polished news card that the rest of the app can use.

Why Does This Matter? (The Results)

The paper reports that this system has been tested and is now running globally on Snapchat.

  • Accuracy: When humans checked the system's work over six months, it was correct about 92.8% of the time.
  • Real-World Impact: The system is now used to help decide what videos appear on your screen (ranking) and what suggestions appear when you type in the search bar.
  • The Outcome: Because the system spots trends faster, users see fresher, more relevant content. The tests showed that users liked the content more (higher "like" rates) and spent more time watching stories that were part of these new trends.

In short, this paper describes a smart, automated way for Snapchat to listen to the "buzz" of its entire user base, filter out the noise, and instantly tell the app, "Hey, everyone is talking about this right now—let's show it to people!"

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →