AMR-CCR: Anchored Modular Retrieval for Continual Chinese Character Recognition

This paper proposes AMR-CCR, an anchored modular retrieval framework with script-conditioned injection and multi-prototype dictionaries to address the challenges of continual, class-incremental ancient Chinese character recognition, accompanied by the new EvoCON benchmark for systematic evaluation.

Yuchuan Wu, Yinglian Zhu, Haiyang Yu, Ke Niu, Bin Li, Xiangyang Xue

Published 2026-03-10
📖 5 min read🧠 Deep dive

Imagine you are the head librarian of a massive, ancient library dedicated to Chinese calligraphy. Your job is to identify every character in the books, rubbings, and stone tablets that keep arriving from archaeological digs.

Here is the problem: The library never stops growing.

Every week, new boxes arrive with characters written in different styles (scripts) from different eras. Some look like tiny drawings (Oracle Bone), others look like elegant cursive (Seal Script). The characters are incredibly similar to each other—like twins wearing slightly different hats—and the handwriting varies wildly depending on who wrote them.

The Old Way: The "Hard-Drive" Librarian

Traditionally, librarians tried to solve this by training a single "super-brain" (a neural network) to memorize every character they've ever seen.

  • The Flaw: Every time a new box of characters arrives, they have to retrain the whole brain.
  • The Result: The brain gets confused. It starts forgetting the old characters to make room for the new ones (a problem called "catastrophic forgetting"). It's like trying to learn a new language by cramming; you might remember the new words, but you start forgetting the old ones. Also, if the handwriting is messy, the brain gets stuck because it expects every "A" to look exactly the same.

The New Way: AMR-CCR (The "Smart Index" System)

The authors of this paper propose a completely different approach called AMR-CCR. Instead of trying to memorize everything in a single brain, they build a dynamic, searchable dictionary.

Here is how it works, using simple analogies:

1. The "Universal Translator" (Shared Embedding Space)

Imagine a universal translator that can turn any character, no matter the script or style, into a single, standardized "ID card" (an embedding).

  • Whether it's a rough stone carving or a smooth bamboo slip, the translator converts it into a unique code.
  • This allows the system to compare a new character against all the old ones instantly, just like checking a fingerprint against a database.

2. The "Script-Specific Glasses" (SIA + SAR)

This is the cleverest part. Different scripts (like Oracle Bone vs. Clerical Script) have different "flavors."

  • The Problem: If you look at an Oracle Bone character with "Clerical Script glasses," it looks weird and confusing.
  • The Solution: The system has a pair of smart glasses for each script.
    • SIA (The Glasses): When a new character arrives, the system puts on the specific glasses for that script to "calibrate" the view, making the character look normal to the database.
    • SAR (The Eye Doctor): Since the system doesn't always know which script a character is from just by looking at it, it has a tiny "Eye Doctor" module. This doctor quickly guesses which pair of glasses to put on before the character is checked against the dictionary.

3. The "Multi-Identity" Dictionary (Multi-Prototype)

In the old system, a character like "Dragon" had only one entry in the dictionary. If the new "Dragon" looked slightly different (maybe written by a left-handed person), the system might miss it.

  • The New Solution: The dictionary doesn't just have one entry for "Dragon." It has multiple entries (prototypes) for the same character, capturing all the different ways it can be written (e.g., "Dragon - Bold Style," "Dragon - Flowing Style," "Dragon - Ancient Style").
  • This ensures that no matter how messy or unique the handwriting is, there's a match waiting for it.

4. The "Zero-Shot" Magic (Reading the Unknown)

Sometimes, archaeologists find a character that has never been seen before. There is no picture of it in the database.

  • The Old Way: The system would just say, "I don't know."
  • The New Way: The system uses text descriptions. If the archaeologist says, "This character looks like a mountain and means 'high'," the system searches its dictionary for a character that matches that description, even if it has never seen a picture of that specific character before. It's like solving a mystery using clues rather than just matching photos.

The Result: EvoCON (The Training Ground)

To prove this works, the authors built a new training ground called EvoCON. It's a simulated library with six different stages of scripts, ranging from the oldest to the newest. They tested their system against the old methods, and the results were clear:

  • Old methods forgot the past and got confused by new styles.
  • AMR-CCR remembered everything, handled the messy handwriting perfectly, and could even guess the meaning of totally new characters using text clues.

In a Nutshell

Instead of trying to force a single brain to memorize an ever-changing, messy library, this paper suggests building a smart, adaptable filing system. It uses special glasses to understand different writing styles, keeps multiple copies of every character to handle variations, and uses text descriptions to solve mysteries of characters it has never seen before. It turns a chaotic, growing problem into a manageable, searchable treasure hunt.