CREward: A Type-Specific Creativity Reward Model

This paper introduces CREward, the first type-specific creativity reward model that evaluates and generates creative images across geometry, material, and texture axes by leveraging human benchmarks and large vision-language models to enable applications in assessment, explanation, and guided generation.

Original authors: Jiyeon Han, Ali Mahdavi-Amiri, Hao Zhang, Haedong Jeong

Published 2026-06-03
📖 4 min read☕ Coffee break read

Original authors: Jiyeon Han, Ali Mahdavi-Amiri, Hao Zhang, Haedong Jeong

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are an art teacher trying to grade a class of students who are drawing chairs. In the past, you might have just given them a single score like "8 out of 10" for "creativity." But that's a bit vague. Did they get an 8 because the chair looked weird? Because it was made of jelly? Or because it had a cool, swirly pattern?

This paper introduces CREward, a new tool that acts like a super-smart art teacher who doesn't just give a single grade. Instead, it breaks "creativity" down into three specific, understandable ingredients: Geometry (the shape), Material (what it's made of), and Texture (the surface details).

Here is how the paper explains this system, using simple analogies:

1. The Problem: Creativity is Too Complicated

The authors argue that treating creativity as one big, blurry blob is "naive." Just like a chef needs to know if a dish is too salty, too sweet, or too spicy to fix it, AI designers need to know exactly what makes an image creative. Is the chair creative because it has a weird shape (Geometry), or because it looks like it's made of glass (Material)?

2. The Solution: A Three-Lens Camera

The researchers built a model called CREward that looks at images through three specific lenses:

  • Geometry: The overall shape and curves. (Think: Is the chair a perfect cube, or does it twist like a corkscrew?)
  • Material: How light bounces off it. (Think: Is it shiny metal, soft velvet, or squishy jelly?)
  • Texture: The fine details on the surface. (Think: Is it smooth, rough, or covered in polka dots?)

3. How They Taught the AI (The "Human vs. Robot" Test)

To teach CREward, the team first asked real human design experts to look at pairs of images and vote on which one was more creative for each of the three categories. This was their "gold standard."

Then, they asked a very smart AI (called an LVLM, or Large Vision-Language Model) to do the same voting. They found something surprising: The AI agreed with the humans almost perfectly. In fact, for some categories, the AI agreed with humans even better than the humans agreed with each other!

Because the AI was so good at mimicking human taste, the researchers used the AI's votes to train CREward. This is like hiring a robot assistant to grade thousands of homework assignments so the teacher doesn't have to do it manually.

4. What CREward Can Do (The "Magic Tools")

Once trained, CREward becomes a Swiss Army knife for creative AI:

  • The Scorecard: It can look at any image and give it three separate scores (one for shape, one for material, one for texture). This helps researchers see which AI models are actually good at making creative shapes versus just making pretty textures.
  • The Filter: Imagine you have a bucket of 1,000 AI-generated chairs. CREward can instantly sort them, pulling out the top 10 that have the wildest shapes or the most interesting materials. This helps human designers find inspiration quickly without scrolling through endless boring images.
  • The Slider: This is the coolest part. The researchers attached "sliders" to the AI generator. If you slide the "Geometry" bar up, the AI starts making chairs with stranger shapes. If you slide "Material" up, it starts making chairs out of weird stuff like water or gold. It's like having a remote control for creativity.
  • The X-Ray: Using a technique called Grad-CAM, CREward can highlight exactly where in the image the creativity is happening. If a chair is creative because of its twisted legs, the tool will highlight the legs, showing you exactly what the AI is focusing on.

5. The Catch (Limitations)

The authors are honest about the flaws.

  • Quality vs. Weirdness: CREward is great at spotting "newness," but it might sometimes think a blurry, nonsensical image is creative just because it's weird. It needs a partner to check if the image actually looks good (quality) before celebrating its creativity.
  • The "Spillover" Effect: Sometimes, when you try to make the shape more creative, the material accidentally changes too. The authors call this "entanglement." It's like trying to turn up the volume on a radio, but the bass gets louder too. They plan to fix this in the future.

Summary

In short, CREward is a new system that stops treating creativity as a mystery. It breaks it down into Shape, Stuff, and Surface, uses a smart AI to learn what humans find creative, and then gives us tools to measure, filter, and control exactly how creative our AI-generated art becomes.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →