Bitcoin Price Prediction using Machine Learning and Combinatorial Fusion Analysis

This paper proposes a Bitcoin price prediction model using Combinatorial Fusion Analysis (CFA) to integrate diverse machine learning models via rank-score characteristics and weighted combinations, achieving a superior Mean Absolute Percentage Error (MAPE) of 0.19% that outperforms individual models and existing prediction methods.

Yuanhong Wu, Wei Ye, Jingyan Xu, D. Frank Hsu

Published Tue, 10 Ma
📖 4 min read☕ Coffee break read

Imagine you are trying to guess the price of Bitcoin tomorrow. It's like trying to predict the weather, but instead of rain or sunshine, the "weather" is a wildly unpredictable digital currency that can swing up or down by thousands of dollars in a single day.

This paper is about a team of researchers who decided to stop relying on just one "weather forecaster" and instead built a super-team of forecasters to get the best possible prediction.

Here is the story of how they did it, broken down into simple steps:

1. The Problem: One Expert Isn't Enough

In the past, people tried to predict Bitcoin prices using single computer programs (Machine Learning models). Think of these models as individual experts:

  • The Statistician: Good at spotting patterns in numbers.
  • The Neural Network: Good at remembering long-term history.
  • The Tree-Planter: Good at making decisions based on "if this, then that" rules.

The problem is that every expert has blind spots. Sometimes the Statistician is right, but the Neural Network is wrong. Sometimes the Tree-Planter is confused by a sudden market crash. Relying on just one is risky.

2. The Solution: The "Super-Team" (Combinatorial Fusion Analysis)

The researchers used a method called Combinatorial Fusion Analysis (CFA). Imagine you are a judge on a talent show. Instead of listening to just one judge, you have five.

  • Judge A gives a score of 8/10.
  • Judge B gives a score of 9/10.
  • Judge C thinks it's a 6/10.

Instead of just averaging their scores, CFA looks at how different their opinions are. If all five judges agree, it's a safe bet. But if they disagree, CFA uses a special math trick to figure out which judge is usually the most reliable in that specific situation and combines their votes in a way that cancels out the mistakes.

3. The Ingredients: What Did They Feed the Team?

To make the predictions, they didn't just look at Bitcoin's past price. They fed the team a massive "smoothie" of data, including:

  • The Crypto Cousins: The price of Ethereum (ETH) and Gold.
  • The Miners' Power: How much computing power is being used to secure Bitcoin (Hashrate).
  • The Global Mood: The S&P 500 (stock market), the VIX (fear index), and even the price of Tesla and Nvidia (because Elon Musk and computer chips affect crypto).
  • The "Secret Sauce": They also added technical math tools that look at trends, like moving averages.

4. The Secret Sauce: Not Just a Number, But a "Cloud"

Most prediction models try to guess a single number (e.g., "Bitcoin will be $95,000"). The researchers thought, "That's too rigid."

Instead, they asked each of their 5 models to draw a cloud of possibilities.

  • Model A says: "It's probably around $95k, but could be between $90k and $100k."
  • Model B says: "It's probably $94k, but could be between $92k and $96k."

They then used their "Super-Team" math to merge these clouds. Where the clouds overlap the most is where the true price is likely to be. This is like looking at a group of people throwing darts at a board; the spot where the most darts cluster is your best guess.

5. The Result: A Crystal Ball That Actually Works

They tested this method on data from 2020 to 2024. The results were impressive:

  • The Error Rate: Their method was off by only 0.19% on average.
  • The Comparison: Other famous methods in the past had error rates of 0.39%, 1.33%, or even 4.49%.

The Analogy:
If you were betting on the price of Bitcoin, using an old method is like asking a friend to guess the temperature. They might say "It's 70 degrees," but they could be off by 10 degrees.
Using this new method is like asking a team of meteorologists, checking their radar, their satellite data, and their historical records, and then combining their reports to say, "It's 70 degrees, and we are 99% sure it's between 69.5 and 70.5."

Why Does This Matter?

The paper proves that diversity is strength. By combining models that think differently (some are good at short-term trends, some at long-term history) and using a smart way to mix their opinions, you get a prediction that is much more robust and accurate than any single model could ever be alone.

In short: They didn't just build a better crystal ball; they built a crystal ball that listens to five different voices and knows how to ignore the noise to find the truth.