Bilateral Trade Under Heavy-Tailed Valuations: Minimax Regret with Infinite Variance

This paper establishes the exact minimax regret rate for contextual bilateral trade under heavy-tailed valuations with infinite variance by extending self-bounding properties to real-valued settings and combining them with truncated-mean estimation to derive an epoch-based algorithm that interpolates between classical nonparametric and trivial linear rates.

Hangyi Zhao

Published Tue, 10 Ma
📖 5 min read🧠 Deep dive

Imagine you are a matchmaker running a bustling marketplace. Every day, a buyer and a seller walk in, each with a secret number in their head representing how much they value an item. Your job is to suggest a price.

  • If your price is too low, the seller walks away, and you miss a sale.
  • If your price is too high, the buyer walks away, and you miss a sale.
  • If your price is just right (between their two secret numbers), the deal happens, and you earn a small "gain" (the difference between their values).

Your goal is to learn the "true value" of the item based on clues (like the weather, the time of day, or the buyer's mood) so you can set the perfect price every time. The "Regret" is simply the total amount of money you lost because you didn't guess the price perfectly.

The Problem: The "Wild" Market

In most previous studies, economists assumed that these secret values were "well-behaved." They assumed that while the numbers might wiggle around, they wouldn't go crazy. Mathematically, this meant the "variance" (how wildly they jump) was finite.

But in the real world—think stock markets, insurance claims, or rare real estate deals—values can be wild. Sometimes, a single event causes a price to jump 100 times higher than usual. These are called Heavy-Tailed distributions. In these markets, the "average" jump is huge, and the "variance" is actually infinite.

Previous algorithms for matchmakers relied on calculating a simple average to learn the true value. But if the data has infinite variance, a simple average is useless; it gets thrown off by one crazy outlier and never settles down.

The Paper's Solution: A New Way to Learn

This paper, by Hangyi Zhao, solves the problem of how to learn in these "wild" markets. Here is the breakdown of their three big ideas, explained simply:

1. The "Self-Bounding" Safety Net

The authors first proved a surprising fact: Even if the market is crazy, your mistake doesn't grow as fast as you think.

Imagine you are trying to hit a bullseye. In a normal market, if you miss by 1 inch, you lose a little money. If you miss by 2 inches, you lose 4 times as much. This is called a squared relationship.
The authors proved that even in this wild, heavy-tailed market, if you guess the price wrong, your "Regret" (lost money) still grows only with the square of your error.

  • Why this matters: It means you don't need to know the exact value to do well. You just need to get close. This turns a hard "guessing the exact number" problem into an easier "estimating the average" problem.

2. The "Truncated Mean" Filter

Since you can't use a normal average (because one crazy outlier ruins it), the authors suggest a clever trick: The Truncated Mean.

Imagine you are listening to a choir. One singer is screaming at the top of their lungs (an outlier).

  • Old Method: You try to calculate the average pitch of the whole choir. The screaming singer makes the average sound terrible.
  • New Method: You decide, "I will ignore anyone singing louder than a certain volume." You cut off the extreme screams. Then, you average the remaining voices.

In the paper, they use a mathematical version of this "cutting off" (truncation). They ignore the most extreme price jumps when calculating the average. This allows them to find the true center of the market even when the data is chaotic.

3. The "Epoch" Strategy

Instead of trying to learn and guess every single second, the matchmaker works in Epochs (rounds of learning).

  • Round 1: Guess randomly.
  • Round 2: Look at the data from Round 1, use the "Truncated Mean" to filter out the noise, and make a better guess for Round 2.
  • Round 3: Use data from Round 2 to refine the guess for Round 3.

By doubling the size of these learning rounds, they prove that the matchmaker gets smarter very quickly, even with infinite variance.

The Results: How Fast Can You Learn?

The paper calculates the exact speed at which you can learn in this wild market.

  • If the market is "normal" (finite variance): You learn at a standard, fast pace.
  • If the market is "wild" (infinite variance): You learn slower, but not impossibly slow. The paper gives a precise formula showing that as the market gets "wilder" (heavier tails), your learning speed slows down, but it never stops.

They also proved that you cannot do better than this speed. No matter how clever your algorithm is, if the market is this wild, you cannot learn faster than their method allows.

The Big Picture Analogy

Think of navigating a ship in a storm.

  • Old Theory: The waves are small and predictable. You can use a standard compass (Average) to steer.
  • This Paper: The waves are tsunamis. One giant wave can knock your compass off.
  • The Solution: The authors built a storm-proof compass. It ignores the giant, freak waves (Truncated Mean) and focuses on the general direction of the smaller waves. They proved that even in a hurricane, you can still steer your ship to the destination, and they calculated exactly how long the journey will take.

In short: This paper teaches us how to make smart decisions in chaotic, unpredictable markets by ignoring the extreme outliers and focusing on the reliable patterns, proving that even in the wildest financial storms, we can still find a path to efficiency.