Restoration-Guided Kuzushiji Character Recognition Framework under Seal Interference

This paper proposes a three-stage restoration-guided framework (RG-KCR) that effectively mitigates seal interference in pre-modern Japanese documents by integrating character detection, image restoration, and classification, thereby significantly improving recognition accuracy compared to existing methods.

Rui-Yang Ju, Kohei Yamashita, Hirotaka Kameko, Shinsuke Mori

Published 2026-02-24
📖 4 min read☕ Coffee break read

Imagine you've found an ancient, dusty diary from 300 years ago. The handwriting is beautiful but messy—like a fast, flowing cursive script that nobody uses anymore. This is Kuzushiji, the cursive writing style of pre-modern Japan.

Now, imagine someone tried to read this diary, but every few pages, there's a giant, red stamp (a seal) slapped right over the words. In the old days, these stamps were like signatures or official seals of ownership, but today, they are the biggest headache for computers trying to read the text. They cover the letters, making them look like a messy red blob.

This paper introduces a new "super-reader" system called RG-KCR (Restoration-Guided Kuzushiji Character Recognition) that solves this specific problem. Here is how it works, broken down into three simple steps:

Step 1: The "Spotter" (Finding the Words)

First, the system needs to know where the words are, even if they are covered by red stamps.

  • The Analogy: Think of this like a game of "Where's Waldo?" but instead of finding Waldo, the computer is looking for individual Japanese characters.
  • What they did: They trained a very sharp-eyed AI (called YOLOv12) to hunt down every single character. Even if a red stamp is sitting on top of a character, this AI is smart enough to say, "Hey, there's a letter hiding under that red ink!" and draw a box around it.

Step 2: The "Magic Eraser" (Cleaning the Image)

Once the system knows where the characters are, it needs to clean them up before trying to read them.

  • The Analogy: Imagine you spilled red paint on a page of text. You wouldn't try to read the letters while the paint is still there. You'd use a magic eraser to gently lift the red paint off without smudging the black ink underneath.
  • What they did: They created a special, fast algorithm that acts like a digital magic eraser. It looks for pixels that are "too red" (the seals) and gently paints over them with the surrounding paper texture. It's like a painter filling in a hole in a wall so the picture looks whole again. Crucially, this step doesn't require the computer to "learn" anything new; it just uses math to spot the red and fix it instantly.

Step 3: The "Translator" (Reading the Clean Text)

Now that the red stamps are gone and the characters are clean, the system can finally read them.

  • The Analogy: This is like handing a clean, clear page to a master translator who knows thousands of ancient dialects.
  • What they did: They fed the cleaned-up character images into a powerful AI called Metom. Because the red stamps were removed in Step 2, Metom can now read the characters with much higher accuracy. It translates the ancient, messy cursive into modern, readable Japanese.

Why is this a big deal?

Before this paper, existing systems were like trying to read a book through a foggy, red-tinted window. They would often guess wrong or get confused when seals were present.

The authors tested their new three-step process and found:

  1. The Spotter found 98% of the characters correctly, even with seals on them.
  2. The Magic Eraser successfully removed the red stamps, making the text look almost like new.
  3. The Translator got much smarter after the cleaning step. Its accuracy jumped from about 93% to 95%.

The Final Result

The system doesn't just output a list of words; it takes the original ancient image, removes the red stamps, and overlays the modern Japanese translation right on top of the original text. It's like having a pair of glasses that instantly cleans up the red ink and whispers the meaning of the ancient words directly into your ear.

In short: They built a pipeline that finds the messy text, erases the annoying red stamps, and then reads the clean text, making history much easier for modern people to understand.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →