Iconographic Classification and Content-Based Recommendation for Digitized Artworks

This paper presents a proof-of-concept system that automates the classification and recommendation of digitized artworks by integrating YOLOv8 object detection with the Iconclass vocabulary and multiple recommendation algorithms to accelerate cataloging and enhance navigation in heritage repositories.

Krzysztof Kutt, Maciej Baczyński

Published 2026-02-24
📖 5 min read🧠 Deep dive

Imagine you walk into a massive, dusty library containing millions of paintings. You want to find a picture of a "sad dog," but the librarians (the museum experts) are too busy to look through every single painting to tell you what's inside. They only wrote down basic notes like "17th century," "oil on canvas," and "artist's name." You're stuck.

This paper introduces a new digital assistant called CARIS (Classification and Recommendation for the Iconclass System) designed to solve this problem. It acts like a super-smart, tireless intern who can look at a painting, figure out exactly what's happening in the story, and then find other paintings with similar stories for you.

Here is how the system works, broken down into simple steps with some creative analogies:

1. The "Eagle-Eyed" Detective (Object Detection)

First, the system needs to know what is actually in the picture. It uses a tool called YOLO (You Only Look Once), which is like a super-fast security camera that scans a painting and shouts out everything it sees: "I see a horse! I see a human! I see a dog!"

  • The Analogy: Think of YOLO as a very fast but slightly literal-minded child. If you show it a picture of a dog, it says "Dog." It doesn't yet know if the dog is a hero, a villain, or just a pet. It just sees the physical object.

2. The "Dictionary Translator" (Mapping to Iconclass)

Now, the system has a list of objects ("dog," "horse"), but museums don't use simple words; they use a massive, complex code system called Iconclass. This is like a giant, organized filing system where every possible theme in art has a specific code (e.g., 34B11 for "dog").

The system tries to translate the child's list ("dog") into the museum's code.

  • The Challenge: The word "dog" is tricky. In art, a dog might mean "loyalty," "hunting," or a specific story about Hercules.
  • The Solution: The system uses a "three-pass" strategy. First, it looks for an exact match. If that fails, it looks for partial matches. Finally, it checks if the dog appears in specific famous stories. It's like a translator who first tries a direct dictionary definition, then checks a thesaurus, and finally asks a local expert for context.

3. The "Story Detective" (Inferring Abstract Meanings)

Sometimes, the painting isn't just about objects; it's about a concept like "Justice" or "Hunting." You can't take a photo of "Justice," but you can take a photo of a woman holding scales and a sword.

  • The Analogy: The system acts like a logic puzzle solver. If it sees a blindfolded woman + scales + sword, it doesn't just say "woman, scales, sword." It connects the dots and says, "Aha! This is the code for Justice!"
  • It uses a set of simple rules (like a recipe book) to combine detected objects into deeper meanings.

4. The "Book Club Matchmaker" (Recommendation)

Once the system has the correct codes for your painting, it needs to find other paintings you might like. It uses three different "matchmaking" strategies:

  1. The Family Tree (Hierarchy): If you like a painting about "Hercules," this method suggests other paintings about "Greek Heroes" or "Mythology," even if they aren't exactly Hercules. It understands that they are cousins in the art family tree.
  2. The Rare Gem Finder (IDF): Some codes are very common (like "sky" or "tree"). Others are rare and specific (like "Hercules biting a snail"). This method gives extra weight to the rare codes. If you have a painting with a rare code, it will prioritize finding other paintings with that same rare code, ignoring the boring common ones.
  3. The Perfect Overlap (Jaccard): This method looks at the whole list of codes. It asks, "How much of this list is shared with that painting?" It's great for finding paintings that are thematically very similar, avoiding suggestions that are just vaguely related.

Why is this important?

Currently, finding specific types of art in huge digital archives is like finding a needle in a haystack. You have to guess the right keywords.

This system changes the game by:

  • Automating the boring work: It does the initial "guessing" of what's in the picture so human experts can focus on the hard stuff.
  • Understanding the story: It doesn't just see pixels; it understands the meaning behind the pixels using the Iconclass system.
  • Connecting the dots: It helps you discover art you didn't know you wanted to see, based on the stories the art tells, not just how it looks.

The Catch (Limitations)

The authors are honest: the system is a "proof of concept," meaning it's a working prototype, not a finished product yet.

  • It's only as good as its eyes: If the "detective" (YOLO) mistakes a cow for a horse, the whole story gets wrong.
  • It needs training: The system needs to be taught more specific art terms to avoid getting confused by similar-looking animals or objects.

The Bottom Line

This paper presents a bridge between Artificial Intelligence (which sees shapes) and Human Culture (which understands stories). By combining a computer's ability to spot objects with a structured system of art history codes, CARIS helps us navigate the vast ocean of human creativity without getting lost. It's not replacing the museum curator; it's giving them a super-powered assistant to help everyone else find the treasure hidden in the collection.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →