A market resilient data-driven approach to option pricing

This paper presents a no-arbitrage-based data-driven ensemble approach for option pricing that establishes a common representation space for domain adaptation and validates its effectiveness through real-world data and experimental results.

Anindya Goswami, Nimit Rana

Published 2026-03-10
📖 5 min read🧠 Deep dive

Imagine you are trying to predict the price of a warranty for a car. If you own a Ferrari, the warranty costs more than for a Toyota, not just because the car is expensive, but because the risk of something breaking is different. In the financial world, these "warranties" are called options. They are contracts that let you buy or sell a stock at a specific price later on.

Figuring out the "fair price" for these options is a huge puzzle. For decades, mathematicians have used complex formulas (like the famous Black-Scholes model) to solve it. But these formulas assume the market behaves in a perfect, predictable way. Real life is messy, chaotic, and full of surprises (like a pandemic or a sudden crash).

This paper proposes a new way to solve the puzzle: Let the data teach us, but give it a little theoretical nudge.

Here is the breakdown of their idea using simple analogies:

1. The Old Way: The "Same Shape" Rule (Homogeneity Hint)

Imagine you have a recipe for a perfect chocolate cake. You know that if you double the ingredients, you get a cake that is exactly twice as big, but it tastes the same. This is called homogeneity.

The authors looked at a rule in finance that says: If two stocks have similar "flavor profiles" (statistical patterns of how their prices move), you can use the pricing data from one to predict the price of the other.

  • The Problem: This works great if you are comparing two similar cars (like a Honda and a Toyota). But what if you try to use the Honda data to price a Ferrari? The "flavor profiles" are too different. The model gets confused and fails. This is called a Domain Shift.

2. The New Idea: The "Universal Translator" (Common Representation Space)

The authors realized that while a Honda and a Ferrari drive differently, they both follow the laws of physics. They just need a translator to speak the same language.

They invented a "translator" called the Volatility Scalar.

  • The Analogy: Imagine you are trying to compare the speed of a snail and a cheetah. If you just look at "meters per second," the numbers are wildly different. But if you measure them in "body lengths per second," they might actually be moving at a similar relative speed.
  • The Solution: The authors created a mathematical "body length" for stocks. They take the chaotic, different price movements of two different stocks (like the NIFTY 50 index and the Bank Nifty index) and scale them down so they look the same.

Once the data is scaled, a machine learning model can learn the "universal rules" of option pricing from one stock and apply them to another, even if they are totally different.

3. The "Smart Switch" (The Ensemble Model)

The authors didn't just stop at the translator. They built a Smart Switch (an Ensemble Model) that decides which method to use based on the weather.

  • Normal Days (Typical Data): If the market is calm and behaving like usual, the model trusts the old "Same Shape" rule (Homogeneity Hint). It's simple and works well.
  • Stormy Days (Atypical Data): If the market is crashing or acting weird (like during the COVID-19 lockdown), the "Same Shape" rule breaks. The model detects this "storm" (using a metric they call the Domain Shift Quotient) and flips the switch to the "Universal Translator" (Domain Shift approach).

Think of it like driving a car:

  • On a smooth highway, you use Cruise Control (the old method).
  • When you hit a bumpy, off-road trail, you switch to 4-Wheel Drive (the new method).
  • The Ensemble Model is the driver who knows exactly when to switch gears so you never get stuck.

4. The Results: Why This Matters

The authors tested this on real Indian stock market data, including the chaotic period of the 2020 pandemic.

  • The Old Models: Got confused during the pandemic and made big errors.
  • The New Model: Adapted quickly. By using the "Universal Translator," it could learn from one part of the market and accurately predict prices in another, even when the market was behaving strangely.
  • The "Super Model": Their final "Smart Switch" model was the best of all. It was accurate on calm days and resilient during the storm.

The Big Takeaway

This paper is a bridge between Math Theory and Data Science.

  • Data Science says: "Throw all the data at a computer and let it guess."
  • Math Theory says: "Follow the strict rules of physics."

The authors say: "Let's use the strict rules to organize the data, so the computer can guess better."

They proved that by understanding why markets move (using the "Volatility Scalar"), we can build AI models that don't just memorize the past, but can actually survive the future, even when the future looks nothing like the past.