Temporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning

This paper identifies temporal imbalance in supervision as a key cause of prediction bias in Class-Incremental Learning and proposes the Temporal-Adjusted Loss (TAL) method, which dynamically reweights negative supervision using a temporal decay kernel to effectively mitigate catastrophic forgetting and improve performance across multiple benchmarks.

Jinge Ma, Fengqing Zhu

Published 2026-03-04
📖 4 min read☕ Coffee break read

Imagine you are a teacher trying to teach a student a new language. You start with Task 1: teaching them 10 words about "Animals." The student learns them well.

Then, you move to Task 2: teaching them 10 words about "Fruits."
Then Task 3: "Vegetables."

In the world of Artificial Intelligence, this is called Class-Incremental Learning (CIL). The problem is that as the teacher introduces new words (Fruits, Vegetables), the student starts to forget the old ones (Animals). This is called Catastrophic Forgetting.

Even worse, the student starts to guess "Fruit" for everything, even when looking at a picture of a dog. They have a bias toward the new things they just learned.

The Old Way of Fixing It

For a long time, researchers thought the problem was simple: "We just have too many pictures of Fruits and Vegetables right now, and not enough of Animals."

So, their solution was like a referee blowing a whistle at the end of the game to tell the student, "Hey, don't guess Fruit so much! Be fair!" They adjusted the final answer sheet (the classifier) to force a balance.

The New Discovery: The "Time" Problem

This paper argues that the old solution is missing the real culprit. It's not just about how many pictures you have; it's about when you saw them.

The authors call this Temporal Imbalance.

Here is the analogy:
Imagine the student is studying for a marathon.

  • Class A (Animals): They studied this 6 months ago. Since then, they haven't seen a single animal picture. Every time they take a practice test, they see a picture of a car, a tree, or a fruit. The teacher keeps saying, "No, that's not an animal!" over and over again. The student gets beaten down by constant "No's" (Negative Supervision).
  • Class B (Fruits): They just studied this yesterday. They see fruit pictures every day. The teacher says, "Yes, that's a fruit!" constantly.

Even if the student has seen the same number of Animal and Fruit pictures in total, the Animal class has been under constant attack by "No's" for months, while the Fruit class has been under constant praise.

By the time the final exam comes, the student is terrified to guess "Animal" because they've been punished for it so many times recently. They only guess "Fruit" because that's what they are currently being reinforced on.

The Solution: Temporal-Adjusted Loss (TAL)

The authors propose a new rule for the teacher called TAL (Temporal-Adjusted Loss).

Think of TAL as a smart memory filter or a volume knob for the teacher's voice.

  1. Tracking the "Freshness": TAL keeps a score for every category (Animals, Fruits, etc.).
    • If a category hasn't been seen in a while, its score drops (it has "low positive supervision").
    • If a category is being seen right now, its score is high.
  2. Turning Down the Volume on "No":
    • When the teacher sees a picture of a Car and says, "This is NOT an Animal," TAL checks the Animal score.
    • If the Animal score is low (because the student hasn't seen animals in a while), TAL says, "Wait, the student is already stressed about Animals. Don't yell 'NO' so loudly!" It turns down the volume of the negative feedback.
    • If the Animal score is high (they just saw an animal), TAL says, "Okay, they are confident. You can yell 'NO' normally if they get it wrong."

Why This Matters

By turning down the "negative volume" for old, forgotten classes, the student doesn't get bullied into forgetting them. They stay confident enough to recognize an old friend (an old class) even when new friends (new classes) are walking around.

The Results

The paper shows that when they use this "smart volume knob" (TAL):

  • The student forgets much less.
  • They get better at recognizing both old and new things.
  • It works like a magic plug-in; you can add it to almost any existing AI system without rebuilding the whole thing.

In a Nutshell

Previous methods tried to fix the AI's bias by adjusting the final answer key. This paper realized the bias was actually caused by the timing of the lessons. Old classes get bullied by constant "No's" because they haven't been seen in a while.

TAL fixes this by whispering "No" gently to the old classes and shouting "No" normally to the new ones, keeping the student's memory balanced over time. It's like giving the old friends a little extra protection so they don't get pushed out by the new kids on the block.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →