NOTAI.AI: Explainable Detection of Machine-Generated Text via Curvature and Feature Attribution

The paper introduces NOTAI.AI, an explainable framework that combines curvature-based signals, neural, and stylometric features within an XGBoost classifier to detect machine-generated text while using SHAP and an LLM layer to generate structured, natural-language rationales for its decisions.

Oleksandr Marchenko Breneur, Adelaide Danilov, Aria Nourbakhsh, Salima Lamsiyah

Published Mon, 09 Ma
📖 4 min read☕ Coffee break read

Imagine you are a detective trying to solve a mystery: Did a human write this story, or did a robot (an AI) write it?

In the past, robots were easy to spot because they sounded stiff or made weird grammar mistakes. But today's AI is like a master forger; it writes so smoothly and naturally that even experts get fooled.

The paper introduces NOTAI.AI, a new "super-detective" tool designed to catch these AI forgers. But unlike old tools that just give you a suspicious "yes/no" score, NOTAI.AI acts like a transparent detective who shows you exactly why they think a text is fake, using simple language anyone can understand.

Here is how it works, broken down with some everyday analogies:

1. The Three Eyes of the Detective

Most detectors look at a text with just one pair of eyes. NOTAI.AI puts on three different pairs of glasses to look at the same text from different angles. It combines these three views to make a final decision:

  • The "Smoothness" Glasses (Curvature):
    Imagine a human writing a story is like walking through a forest. You might step on a rock, trip over a root, or take a sudden turn. It's a bit bumpy and unpredictable.
    AI, however, is like a robot on a perfectly paved highway. It takes the most direct, "smoothest" path every time.

    • The Trick: NOTAI.AI measures how "bumpy" the text is. If the text is too smooth and predictable (like a highway), it suspects a robot. If it has natural bumps and irregularities, it suspects a human.
  • The "Vocabulary" Glasses (Stylometrics):
    This looks at the "fingerprint" of the writing.

    • Repetition: Humans get bored and use different words. AI sometimes gets stuck in a loop, repeating the same phrases or sentence structures.
    • Word Choice: Humans might use rare, weird words (like "hapax legomena" – words you only see once). AI tends to stick to the most common, safe words.
    • Punctuation: How many commas or periods are there? Humans have a unique rhythm; AI often has a robotic, uniform rhythm.
  • The "Brain" Glasses (Neural Signals):
    This is a super-smart AI (called ModernBERT) that has read millions of books. It acts like a literary critic. It asks, "Does this sentence feel like something a human would say, or does it feel like something a computer would generate?" It gives a "gut feeling" score based on deep patterns.

2. The "Super-Brain" (The Meta-Classifier)

Once the three pairs of glasses gather their clues, they pass the notes to a Chief Detective (an algorithm called XGBoost).

  • The Chief Detective doesn't just listen to one clue. It weighs them all together.
  • Example: "The text is very smooth (AI clue), but it uses a very rare word (Human clue). Let's look at the punctuation... ah, the punctuation is too perfect. I'm 96% sure this is AI."

3. The "Translator" (Explainability)

This is the most important part. Old detectors would just say, "95% Chance: AI." That's not helpful if you are a teacher or a journalist trying to prove it.

NOTAI.AI has a Translator (an LLM) that turns the math into a story.

  • Instead of: "SHAP value for Feature 4 is 0.45."
  • It says: "We think this is AI because the text is unusually smooth, like a highway, and it repeats the phrase 'in conclusion' three times, which is a common robot habit."

It gives you a structured report with a summary and the top 3 reasons why, written in plain English.

4. The "Playground" (Interactive Demo)

The authors built a website where you can play with the tool.

  • The "What-If" Game: You can turn off specific clues. For example, you can tell the detective, "Ignore the smoothness test, just look at the vocabulary."
  • The Result: You can watch the detective change its mind in real-time. This helps you understand which clues matter the most.

Why Does This Matter?

In a world where AI can write essays, news articles, and emails that look perfect, we need tools that don't just guess, but explain.

  • For Teachers: It helps spot AI essays without accusing a student unfairly.
  • For Journalists: It helps verify if a news story was written by a human or a bot farm.
  • For Everyone: It builds trust. You aren't just told the answer; you are shown the evidence.

In short: NOTAI.AI is like a detective that doesn't just point a finger; it hands you a magnifying glass, points out the specific footprints and fingerprints, and explains the whole story in a way that makes perfect sense.