Generalized Stock Price Prediction for Multiple Stocks Combined with News Fusion

This paper proposes a generalized stock price prediction model that integrates historical prices with daily news filtered via stock name embeddings and attention-based pooling mechanisms within a Large Language Model framework, achieving a 7.11% reduction in Mean Absolute Error compared to baselines.

Pei-Jun Liao, Hung-Shin Lee, Yao-Fei Cheng, Li-Wei Chen, Hung-yi Lee, Hsin-Min Wang

Published 2026-03-23
📖 4 min read☕ Coffee break read

Imagine you are trying to predict the weather for next week. You could just look at the temperature from the last 20 days, but that's not enough. You also need to know about the wind, the humidity, and maybe even read the local news to see if a storm is brewing.

This paper is about building a super-smart stock market weather forecaster. Instead of just looking at past stock prices, the authors built a system that reads thousands of daily news articles to guess what a stock price will do tomorrow.

Here is the breakdown of their invention, explained with simple analogies:

1. The Problem: Too Much Noise

Imagine you are trying to listen to a specific friend's voice at a loud, chaotic party.

  • The Old Way: You ask a bouncer to find only the people talking about your friend. If the bouncer is bad, you miss important info. If the bouncer is too strict, you get no info.
  • The New Problem: If you just listen to everyone at the party, you get overwhelmed by noise. You can't tell if someone is talking about your friend or just about the food.
  • The Authors' Solution: They built a "smart filter" that sits right at the party. It knows exactly who your friend is and instantly tunes out everyone talking about anything else.

2. The Secret Weapon: "Stock Name" as a Magnet

The core trick in this paper is using the Stock Name (like "Apple" or "TSMC") as a magnet to pull out the right news.

  • The Setup: Every day, there are hundreds of news articles. Most are irrelevant.
  • The Magic: The system takes the name of the stock (e.g., "TSMC") and turns it into a digital "magnet."
  • The Filter: It runs this magnet through the pile of news articles.
    • Cross-Attention (The Magnet): The stock name "calls out" to the news. Only the articles that "answer back" (are relevant) get picked up.
    • Self-Attention (The Group Hug): The system looks at the stock name and all the news articles together, letting them "hug" and decide which ones fit best.
    • Position-Aware (The Timeline): It also remembers the order in which the news appeared, just in case the timing matters.

3. The Brain: A Giant Language Model (LLM)

Once the system has filtered the news, it needs a brain to make sense of it.

  • They use a Large Language Model (LLM), which is like a super-intelligent robot that has read almost everything ever written.
  • Usually, these robots are great at writing stories but bad at math. The authors had to teach this robot how to read numbers (stock prices) by wrapping the numbers in a special "suit" (called Patch Reprogramming) so the robot understands them as if they were words.

4. The "General" Student vs. The "Specialist"

  • Old Approach: Train a different student for every single stock. One student learns only about Apple, another only about Ford. This is expensive and slow.
  • This Paper's Approach: Train one single "General" student to learn about all stocks at once.
  • Why it's cool: This student learns the general rules of the market. If they understand how "Tech" stocks usually react to bad news, they can apply that logic to a new stock they've never seen before. It's like teaching a chef to cook any dish, rather than hiring a different chef for every recipe.

5. The Results: Smarter and Faster

The authors tested this on the Taiwan Stock Exchange and US stocks.

  • The Score: Their new method reduced prediction errors by about 7% compared to the old ways. In the world of finance, that's a huge win.
  • The Trade-off: It takes longer to train (2 days vs. 30 minutes for a simple model), but the accuracy is much better. It's like using a slow, high-end GPS instead of a quick, cheap one that might get you lost.

Summary Analogy

Think of the stock market as a giant library.

  • Old Models: They just look at the checkout history of books (past prices).
  • This Model: It sends a librarian (the AI) into the library. The librarian is holding a specific book title (the Stock Name). The librarian scans the whole library, grabs only the books that mention that title, summarizes them, and then uses a super-brain to guess what the next chapter (tomorrow's price) will be.

By using the stock name to filter the noise, the system stops getting distracted by irrelevant news and makes much smarter guesses.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →