Measuring onion website discovery and Tor users' interests with honeypots

This paper presents a honeypot-based study conducted in early 2025 that reveals human Tor users primarily discover onion sites via the Ahmia search engine and exhibit significantly higher engagement with child sexual abuse material (CSAM) compared to other illicit categories, with English-language versions attracting the most interaction.

Arttu Paju, Waris Abdullah, Juha Nurmi

Published Wed, 11 Ma
📖 5 min read🧠 Deep dive

Imagine the dark web (specifically the Tor network) as a massive, foggy city where everyone wears a mask, and the streets are lined with millions of hidden doors. Most people think they know what's behind those doors because they've read reports about the "shops" selling illegal drugs or weapons. But this study asks a different question: What are the actual people walking around this city trying to open?

To find out, the researchers didn't just look at the doors; they built traps (called "honeypots") that looked exactly like the things people were rumored to want.

Here is the story of their experiment, broken down simply:

1. The Setup: Building Fake "Trap Doors"

The researchers built 32 fake websites. They looked like real forums for all sorts of illegal activities:

  • Child abuse material (CSAM)
  • Violence
  • Malware (computer viruses)
  • Stolen goods
  • Illegal guns
  • Illegal drugs
  • Counterfeit items
  • A "mystery" forum (the control group)

They made these sites in four languages: English, German, Finnish, and Russian.

The Bouncer: To make sure they were talking to real humans and not just computer robots, they put a CAPTCHA (those "click all the traffic lights" puzzles) in front of the door. If you couldn't solve the puzzle, you couldn't get in.

2. The Bait: How Did People Find the Traps?

The researchers needed to see how people found these hidden doors. They used three different "signposts" to lead people to their traps:

  1. Ahmia: A search engine for the dark web (like Google, but for the hidden city).
  2. Paste sites: Websites where people paste links (like a digital bulletin board).

They dropped links to their traps in these three places and waited to see who walked in.

3. The Big Surprise: The Search Engine vs. The Bulletin Board

The Result: It was a landslide.

  • Ahmia (The Search Engine): Almost every single real human came from here. When the researchers removed the links from the search engine, the traffic from real humans dropped to almost zero.
  • Paste Sites: These sites were flooded with traffic, but it was almost entirely robots and automated bots. When the researchers looked at who actually solved the CAPTCHA and tried to log in, the paste sites had zero real human activity.

The Analogy: Imagine you put up a flyer for a secret party.

  • Ahmia is like a person walking down the street reading the flyer and actually showing up.
  • Paste sites are like a wind machine blowing thousands of copies of the flyer around. It looks busy, but no one is actually there; it's just paper flying in the wind.

4. The "What They Wanted" Shock

Once the humans got past the CAPTCHA, they saw the "menu" of fake illegal activities. The researchers counted how many people tried to create an account or log in for each topic.

The Ranking of Interest:

  1. 🥇 Child Sexual Abuse Material (CSAM): This was the most popular category by a huge margin. More than double the interest of the next category.
  2. 🥈 Violence: The second most popular.
  3. 🥉 Malware & Stolen Goods: Moderate interest.
  4. 📉 Illegal Drugs: Surprisingly, this was very low on the list.

Why is this weird?
Most news stories and studies focus on the dark web as a giant "drug market." But this study suggests that for the average person stumbling onto the dark web via a search engine, drugs aren't the main attraction. They are more interested in the disturbing, violent, or abusive content.

Note: The researchers suspect that people looking for drugs already know exactly which "shops" to go to and don't need to search for them. But people looking for the other, more disturbing content might be using search engines to find it.

5. The Language Barrier

When the researchers offered the same traps in different languages, English was the clear winner.

  • English: The most popular.
  • German & Finnish: Second and third.
  • Russian: Surprisingly, the least popular, even though Russian is a huge language on the dark web.

The "Finnish" Twist: The study was done by researchers in Finland, and the search engine (Ahmia) is Finnish. This likely explains why Finnish was so popular—it's the "home turf" of the search engine.

The Takeaway

This study is like a mood ring for the dark web. Instead of guessing what people want based on what's available, they measured what people actually tried to access.

  • Main Finding: Real humans mostly find the dark web through search engines, not by pasting links.
  • Biggest Shock: The content that draws the most human curiosity isn't drugs or weapons; it's the most disturbing, abusive material.
  • Language: English speakers are the most active group in this specific "search engine" crowd.

The researchers emphasize that this only shows the behavior of people using a specific, filtered search engine. It doesn't represent everyone on the dark web, but it gives us a very clear picture of what happens when a regular person types a query and clicks a link.