YC Bench: a Live Benchmark for Forecasting Startup Outperformance in Y Combinator Batches

This paper introduces "YC Bench," a live benchmark using the Y Combinator W26 batch to enable rapid, short-term forecasting of startup outperformance by measuring pre-Demo Day traction signals, thereby reducing the traditional multi-year evaluation cycle to just a few months.

Mostapha Benhenda

Published 2026-04-06
📖 5 min read🧠 Deep dive

Imagine you are a talent scout trying to find the next big superstar in a room full of 200 aspiring actors. The problem? You won't know who actually becomes a movie star for another 7 to 10 years. By then, you've missed the boat, and you can't learn from your mistakes quickly.

This is exactly the problem investors face with startups. They want to predict which companies will become the next Google or Uber, but the "proof" (like a massive sale or huge profits) takes a decade to show up.

Enter "YC Bench."

Think of YC Bench as a "speed-run" version of the startup world. It's a new tool created by Mostapha Benhenda to test how well we can predict startup success, but instead of waiting 10 years, we wait just three months.

Here is the simple breakdown of how it works, using some everyday analogies:

1. The Setting: The "Talent Show" (Y Combinator)

Y Combinator (YC) is like a massive, high-stakes talent show. Every few months, they invite about 200 startups to join a "batch."

  • The Catch: They all start at the same time, get the same coaching, and have to perform on the same day (called "Demo Day").
  • The Opportunity: Because they are all in the same room, starting at the same time, we can compare them fairly. It's like a 100-meter dash where everyone starts on the same line.

2. The Problem: The "Crystal Ball" is Broken

Usually, investors try to guess who will win the race by looking at the finish line (exits, IPOs). But the finish line is 7 years away.

  • The Solution: YC Bench says, "Let's not wait for the finish line. Let's see who is running the fastest right now, three months in."
  • The Metric: They created a "Pre-Demo Day Score." This is a single number that tries to guess who is the "fastest runner" before the race is even over.

3. How They Score the Runners

To calculate this score, they look at two things, like a coach judging a runner:

  • The "Traction" Score (The Hard Data):
    If a startup has already made money or has thousands of users, they get points. It's like a runner who has already won local races.

    • Analogy: If a startup says, "We have $25,000 in sales," that's a huge point. If they just have a list of people who might buy, that's fewer points.
    • The Twist: They also look at speed. If a company's sales are doubling every month, they get a massive "momentum bonus," like a runner who is sprinting faster than everyone else.
  • The "Attention" Score (The Hype):
    Most startups don't have sales data yet. So, the system looks at how much people are talking about them on Google.

    • Analogy: If a startup has no money yet, but 5,000 people are searching for their name on Google, that's like a runner who has a huge fan club cheering them on. The system assumes: "If people are searching for them, they might be onto something."

The Final Score: The system takes the better of the two numbers. If you have money, you win. If you don't have money but you are famous, you still win.

4. The Experiment: The "Google Guess"

The author tested a simple idea: "Can we guess the winners just by looking at who was famous before the race even started?"

  • The Setup: They looked at how many times people Googled the startups' names before they even applied to Y Combinator (months before the race started).
  • The Result: This simple "Google Guess" was surprisingly good!
    • It correctly identified 55% of the top performers (the ones who would have the most traction by Demo Day).
    • It was 2.75 times better than just guessing randomly.

What does this mean?
It means that even before a startup gets into a famous accelerator, the "hype" and visibility they already have on the internet are strong clues about their future success.

5. Why This Matters

Think of YC Bench as a video game training level for investors and AI researchers.

  • Before: Investors had to play the "real game" and wait 10 years to see if their strategy worked. If they lost, they couldn't learn because the game was over.
  • Now: With YC Bench, they can play a "training level" that lasts only 3 months. They can test their prediction models, see if they work, fix their mistakes, and try again immediately.

The Bottom Line

This paper introduces a new way to test our ability to predict startup success. Instead of waiting a decade for the results, we can use a clever mix of "sales data" and "Google search volume" to get a reliable prediction in just a few months.

It turns the slow, painful process of investing into a fast, iterative learning loop, helping us find the "superstars" much earlier in the game.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →