PriorWeaver: Prior Elicitation via Iterative Dataset Construction

PriorWeaver is an interactive visualization system that simplifies Bayesian prior elicitation for novice analysts by allowing them to iteratively construct and refine datasets reflecting their beliefs, resulting in priors that are more aligned with their expectations compared to traditional methods.

Yuwei Xiao, Shuai Ma, Antti Oulasvirta, Eunice Jun

Published Mon, 09 Ma
📖 4 min read☕ Coffee break read

Imagine you are a chef trying to teach a robot how to cook your famous soup.

In the old way of doing things (traditional statistics), you would have to speak the robot's language: "Add exactly 3.14 grams of salt with a variance of 0.5." You'd have to translate your gut feeling ("It needs to be salty, but not too salty") into complex math formulas. If you get the math wrong, the robot makes a disaster, and you have no idea why.

PriorWeaver is a new tool that changes the game. Instead of speaking math, it lets you speak reality.

Here is how it works, using simple analogies:

1. The Problem: The "Translator" Gap

In Bayesian analysis (a way of learning from data), you need to tell the computer what you already believe before you start looking at new data. This is called "eliciting a prior."

  • The Old Way: You have to guess abstract numbers (like "the slope of the line should be 0.7"). It's like trying to describe a color to someone who only understands hex codes. Most people aren't fluent in "hex code," so they guess, get it wrong, and the robot makes weird soup.
  • The New Way (PriorWeaver): You don't guess numbers. You build a fake dataset that looks exactly like the world you know.

2. The Solution: Building a "Dream World"

PriorWeaver treats the process like building a Lego city or a cast of characters for a movie.

  • Step 1: The Ingredients (The Histograms)
    Imagine you are thinking about "Age" and "Income." Instead of typing "Average age is 40," you click on a chart to add dots. You add a dot for a 25-year-old, a dot for a 60-year-old, and a bunch of dots for 40-year-olds. You are literally drawing the shape of the population in your head.

    • Analogy: You are filling a bucket with marbles of different sizes to represent the people you know.
  • Step 2: The Connections (The Scatterplots & Parallel Coordinates)
    Now, you need to connect the dots. You know that older people usually have more education, but maybe not the highest income.

    • In PriorWeaver, you can "brush" (select) a group of dots. You can say, "Take these 50-year-olds with high education and give them a moderate income."
    • Analogy: It's like casting actors for a play. You pick a 50-year-old actor, give them a script that says "High Education," and assign them a salary of $60k. You do this for hundreds of imaginary people until your "cast" perfectly matches your real-world experience.

3. The Magic: The "Reality Check"

Once you have built your "Dream World" (your dataset of imaginary people), you hit a button called "Translate."

  • The Translation: The computer takes your imaginary people, runs the math behind the scenes, and says, "Okay, based on your cast, here is what the future data should look like."
  • The Feedback: It shows you a graph of the results.
    • Scenario: You built a world where everyone is rich. The computer says, "Your model predicts everyone will make $1 million."
    • Your Reaction: "Whoa, that's too high! I didn't mean everyone to be rich."
    • The Fix: You go back to your "Dream World," delete a few of the super-rich imaginary people, and add a few average earners. You hit "Translate" again.
    • Result: Now the prediction looks realistic.

Why This Matters

The paper tested this with 17 people who knew statistics but had never used Bayesian methods before.

  • Without PriorWeaver: They felt like they were guessing in the dark. They had to do mental math to convert "I think rich people are rare" into "The parameter must be 0.04." They felt confused and unsure.
  • With PriorWeaver: They felt like directors. They could see their beliefs, touch them, and tweak them.
    • They said it felt "intuitive" because they were just adding examples they knew from real life.
    • They felt confident because they could see exactly why the computer was making a weird prediction (e.g., "Oh, I added too many high-income 20-year-olds").

The Big Takeaway

PriorWeaver turns a scary math problem into a creative design task.

Instead of asking you to be a mathematician, it asks you to be a storyteller. It says, "Don't tell me the numbers. Just show me the story of the world you believe in." Once you've told the story, the computer handles the heavy lifting of the math.

This makes a complex scientific method accessible to anyone who can imagine a scenario, opening the door for more people to use powerful tools to understand the world.