Better Measurement or Larger Samples? Data Collection for Policy Learning with Unobserved Heterogeneity

This paper derives regret bounds and minimax optimal data collection strategies for policymakers facing unobserved heterogeneity, demonstrating through a development economics application that targeting based on latent traits improves welfare and that the optimal resource allocation balances improving measurement precision against increasing sample size.

Giacomo Opocher

Published 2026-04-09
📖 4 min read☕ Coffee break read

Imagine you are a mayor trying to decide who should receive a special grant to help start a small business. You have a limited budget, and you want to give the money to the people who will use it best to make a profit.

This paper is about a tricky dilemma the mayor faces: Should you spend your money gathering more detailed information about a few people, or should you spend it on a larger list of people with less detailed information?

Here is the story of the paper, broken down into simple concepts.

1. The Hidden "Superpower" (Unobserved Heterogeneity)

In the real world, people are different. Some have a "hidden superpower" (like natural business talent, high motivation, or grit) that you can't see just by looking at their age or education.

  • The Old Way: Policymakers usually look only at what they can see (age, education, income). They guess who will succeed based on these visible traits.
  • The New Idea: What if we could measure that "hidden superpower"? For example, asking neighbors to rank each other's business skills. If we use this ranking to decide who gets the money, we might do a better job.

2. The Problem: The "Blurry Photo" vs. The "Big Group"

The author, Giacomo Opocher, points out two big problems with using these hidden traits:

  1. The Blurry Photo (Measurement Error): You can't measure "business talent" perfectly. Asking neighbors for a ranking is like taking a photo with a slightly blurry lens. It helps, but it's not 100% accurate. If the photo is too blurry, you might give money to the wrong person.
  2. The Big Group (Sample Size): You have a fixed budget. If you spend a lot of money getting very clear photos (asking 5 neighbors to rank everyone), you have less money left to actually talk to a large number of people. If you talk to fewer people, your data is shaky, and you might make a bad decision just because you didn't have enough examples.

The Trade-off: Do you buy a super-clear photo of 100 people, or a slightly blurry photo of 1,000 people?

3. The Solution: Finding the "Sweet Spot"

The author created a mathematical formula (a "regret bound") to figure out the answer. Think of this formula as a GPS for budgeting.

  • When to focus on clarity: If the "hidden superpower" (business skill) is huge and makes a massive difference in who succeeds, it's worth spending money to get a clearer measurement, even if you have to talk to fewer people.
  • When to focus on quantity: If the "hidden superpower" doesn't matter that much, or if getting a clear measurement is incredibly expensive, it's better to ignore the hidden trait and just talk to as many people as possible using standard info (like age and education).

The paper proves that there is a specific "tipping point." If the hidden trait is important enough, the blurry photo is still better than no photo at all, but you have to balance how many photos you take.

4. The Real-World Test: The Indian Market Experiment

To prove this works, the author looked at a real experiment in rural India where micro-entrepreneurs were given cash grants.

  • The Setup: Researchers asked entrepreneurs to rank their peers on business skills. This ranking was the "proxy" for the hidden talent.
  • The Findings:
    • Using the rankings (the hidden trait) increased the total wealth generated by 5%.
    • It cut the chance of making a "bad decision" (giving money to someone who fails) in half.
    • The Budget Twist: The researchers simulated different budgets. They found that if the budget is tight, you shouldn't try to get the perfect ranking (asking 5 neighbors). Instead, you should ask fewer neighbors (maybe 2) and use the saved money to include more entrepreneurs in the study.
    • The Result: Even with a small budget, it was always better to use some ranking information than to ignore it completely. But the "perfect" amount of information changes depending on how much money you have.

The Big Takeaway

This paper tells policymakers: Don't just guess, and don't just collect data blindly.

If you want to help people effectively, you need to measure the things that really matter (like motivation or skill), even if your measurement isn't perfect. However, you must be smart about your budget. Sometimes, a "good enough" measurement of many people is better than a "perfect" measurement of a few. The author gives you the math to find that perfect balance so you can maximize the good you do with every dollar spent.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →