Design of Bayesian Clinical Trials with Clustered Data

This paper proposes an efficient method for assessing operating characteristics and determining sample sizes in Bayesian clinical trials with clustered data by proving theoretical results that model posterior probabilities as a function of cluster counts, thereby eliminating the need for computationally intensive Monte Carlo simulations across multiple design configurations.

Luke Hagar, Shirin Golchi

Published 2026-03-17
📖 5 min read🧠 Deep dive

Imagine you are a chef planning a massive banquet for a new community. You want to test a new recipe (a new medicine) against the old standard recipe to see if the new one is just as safe.

But here's the catch: You aren't cooking for individuals one by one. You are cooking for groups (like families or households). If one family member gets sick from the food, the whole family might react similarly because they share the same kitchen, the same ingredients, and the same environment. In statistics, we call this clustered data.

This paper, written by Luke Hagar and Shirin Golchi, tackles a very expensive and time-consuming problem: How do you figure out how many groups (clusters) you need to invite to your banquet to be sure your new recipe works, without actually cooking the whole meal a thousand times?

Here is the breakdown using simple analogies:

1. The Problem: The "Taste-Test" Bottleneck

In the past, to design a clinical trial, statisticians had to play a game of "What If?" thousands of times on a computer.

  • The Old Way: They would simulate the trial with 100 groups, then 101, then 102, all the way up to 200. For each number, they had to run a complex computer simulation (like a high-end video game) to see if the new medicine looked safe.
  • The Pain: This is like trying to find the perfect amount of salt in a soup by cooking the entire pot from scratch, tasting it, throwing it away, and starting over again for every single pinch of salt you want to test. It takes forever and costs a fortune in computing power.

2. The Solution: The "Two-Point" Shortcut

The authors discovered a mathematical magic trick. They realized that if you look at the results of your "taste tests" (the computer simulations) at just two specific group sizes, you can draw a straight line through them and predict the results for any other group size in between.

  • The Analogy: Imagine you are trying to guess how tall a tree will be in 10 years.
    • The Old Way: You measure the tree every single day for 10 years.
    • The New Way: You measure the tree today (at 100 groups) and measure it again in 4 years (at 140 groups). Because trees grow in a predictable, steady way, you can draw a straight line between those two points and accurately guess how tall it will be at 115 groups, 120 groups, or 130 groups. You don't need to wait or measure the middle points.

3. The "Magic Line" (The Theory)

The paper proves mathematically that for these types of group-based trials, the relationship between the number of groups and the "safety score" of the medicine is almost a straight line.

  • The Logit Line: They use a specific mathematical curve (called a "logit") that turns the messy, wiggly probability numbers into a straight line.
  • The Result: Instead of running 10,000 simulations to check 100 different group sizes, they only need to run simulations for two sizes (e.g., 100 groups and 140 groups). They draw the line, and boom—they know the answer for every number in between.

4. Why This Matters (The "Banquet" Outcome)

In the real world, this saves a massive amount of time and money.

  • Speed: In their example, calculating the results for a whole range of group sizes used to take 35 minutes of heavy computer time. With their new method, it took only 8 minutes.
  • Accuracy: They showed that their "straight line" guess was almost identical to the "cook everything from scratch" method.
  • Confidence: They also built a "safety net" (called a bootstrap confidence interval) to tell the researchers, "We are 95% sure the right number of groups is between 114 and 116."

5. The Real-World Example: Tuberculosis

The authors tested this on a real-world scenario involving a trial for Tuberculosis (TB) prevention.

  • The Setup: Families (clusters) were given either a new TB drug or the old one.
  • The Goal: To prove the new drug was "non-inferior" (just as safe) as the old one.
  • The Challenge: Because families share environments, their health outcomes are linked. This makes the math very hard.
  • The Win: Using their shortcut, they quickly determined that they needed about 115 to 129 families (depending on how much the families influenced each other) to be confident in the results. Without this method, designing this trial would have been a computational nightmare.

Summary

This paper is like giving a chef a predictive ruler. Instead of tasting the soup 100 times to find the perfect recipe, the chef tastes it twice, draws a line, and knows exactly how much salt to add for any size crowd. It makes designing medical trials faster, cheaper, and more efficient, ensuring that new medicines can be tested and approved without wasting resources.