Online Order Fulfillment with Replenishment

This paper investigates the relative impact of inventory replenishment policies versus real-time online fulfillment algorithms on system profit under demand uncertainty, demonstrating that cumulative regret remains stable over long cycles and introducing a novel look-ahead algorithm that outperforms myopic baselines while identifying specific regimes where optimizing one lever yields greater revenue gains than the other.

Zi Ling, Jiashuo Jiang, Linwei Xin

Published 2026-03-05
📖 5 min read🧠 Deep dive

Imagine you are running a busy, high-end bakery. Every morning, you have to make two critical types of decisions:

  1. The Replenishment Decision: How much flour and sugar should you order from the wholesaler? (This takes time to arrive).
  2. The Fulfillment Decision: When a customer walks in and orders a cake, do you sell it now, or do you save the ingredients for a bigger order that might come later?

Most business schools teach you how to do one of these things perfectly, but rarely how to do both together. This paper asks a simple but profound question: Which of these two levers matters more for your profit? Is it better to have a perfect ordering system, or a perfect salesperson?

Here is the breakdown of their findings, using some everyday analogies.

1. The Two Worlds (The Problem)

Traditionally, researchers have studied these problems in isolation:

  • The "Ordering" Experts: They study how to order flour so you never run out, but they assume you just sell everything that comes in immediately. They don't worry about which customer gets the cake.
  • The "Sales" Experts: They study how to decide which customer gets the cake in real-time to make the most money, but they assume you magically have infinite flour or that your ordering is already perfect.

In the real world (like Amazon or a busy bakery), these two problems are tangled together. If you order too little flour, your fancy sales strategy doesn't matter because you have nothing to sell. If you order too much, you waste money on storage.

2. The Big Discovery: The "Long Line" vs. The "Short Line"

The authors ran a massive simulation to see which lever (Ordering vs. Selling) pulls the most weight. They found that the answer depends entirely on how often you restock.

Scenario A: The "Long Line" (Infrequent Restocking)

Imagine you order flour only once a month.

  • The Finding: In this case, how you sell matters most.
  • The Analogy: Think of your inventory as a single tank of gas for a long road trip. Once you fill the tank, you can't get more gas until the next town. If you drive recklessly (bad sales strategy), you'll run out of gas before you get there, no matter how good your engine (ordering) was.
  • The Result: If you have a "smart" sales algorithm that knows how to ration the gas, you win big. If you have a "dumb" sales algorithm, you lose, even if your ordering was perfect.

Scenario B: The "Short Line" (Frequent Restocking)

Imagine you order flour every single morning.

  • The Finding: In this case, how you order matters most.
  • The Analogy: Think of a water fountain that refills itself every second. It doesn't matter if the person drinking from it is clumsy or smart; as long as the water keeps flowing, they won't go thirsty.
  • The Result: The authors found that if you have a "dumb" salesperson who just sells to whoever walks in, but you have a "smart" ordering system that keeps the tank full, you will actually make more money than a company with a "smart" salesperson but a "dumb" ordering system.

The Takeaway: If you restock often, fix your supply chain first. If you restock rarely, fix your sales strategy first.

3. The "Regret" Stability (The Magic of Base-Stock)

One of the paper's technical highlights is a concept called "Regret Stability."

  • The Concept: "Regret" is the money you could have made if you had a crystal ball.
  • The Finding: The authors discovered that if you use a specific type of ordering policy (called a Base-Stock Policy), your "regret" doesn't pile up over time.
  • The Analogy: Imagine a leaky bucket. If you have a smart ordering system (Base-Stock), it's like having a self-repairing bucket. Even if you make a mistake today (sell the wrong cake), the system automatically corrects itself tomorrow by ordering the right amount to get back on track. The mistakes don't compound; they stay small.

4. The "Crystal Ball" Mistake (Look-Ahead)

The paper also looked at "Myopic" algorithms. "Myopic" means "short-sighted."

  • The Mistake: A short-sighted salesperson sees a customer today and thinks, "I have cake, I'll sell it!" They don't think, "Wait, a VIP customer who pays double is coming tomorrow."
  • The Surprise: The authors found that sometimes, a short-sighted human (who just sells everything) can actually do worse than a smart computer that doesn't even know the future, simply because the computer is designed to be conservative.
  • The Solution: They invented a new "Look-Ahead" algorithm. This is like a salesperson who checks the weather forecast and the VIP schedule before selling a cake.
  • The Result: This new algorithm consistently made more money (about 1-2% more) than the old methods. In a massive business like Amazon, 1% is millions of dollars.

Summary: What Should a Manager Do?

  1. Check your restocking speed: If you restock rarely (long cycles), invest heavily in your sales software (fulfillment algorithms). If you restock often (short cycles), invest heavily in your supply chain (replenishment policies).
  2. Don't ignore the future: Even a simple "Look-Ahead" feature that checks the next few days' demand can significantly boost profits compared to just reacting to the customer in front of you.
  3. Stability is key: Use a "Base-Stock" ordering policy. It acts like a shock absorber, ensuring that small mistakes in sales don't turn into catastrophic inventory disasters.

In short: Don't just optimize the salesperson or the truck driver in isolation. The magic happens when you tune them to work together, especially knowing how often the truck arrives.