A Benchmark of Classical and Deep Learning Models for Agricultural Commodity Price Forecasting on A Novel Bangladeshi Market Price Dataset

This paper introduces AgriPriceBD, a novel benchmark dataset of daily Bangladeshi agricultural commodity prices, and systematically evaluates classical and deep learning forecasting models, revealing that simple persistence often outperforms complex architectures due to the data's random-walk nature and the incompatibility of certain models with small, step-function price dynamics.

Tashreef Muhammad, Tahsin Ahmed, Meherun Farzana, Md. Mahmudul Hasan, Abrar Eyasir, Md. Emon Khan, Mahafuzul Islam Shawon, Ferdous Mondol, Mahmudul Hasan, Muhammad Ibrahim

Published 2026-04-09
📖 5 min read🧠 Deep dive

Imagine you are a farmer in Bangladesh trying to decide when to sell your garlic or green chilies. If you sell too early, you might lose money; if you wait too long, the price might crash. To make the best decision, you need a crystal ball that can predict the future price of these crops.

This paper is essentially a report card for different "crystal balls" (computer models) trying to predict the prices of five common Bangladeshi crops: garlic, chickpeas, green chilies, cucumbers, and sweet pumpkins.

Here is the story of what they found, explained simply:

1. The Missing Map (The New Dataset)

Before this study, trying to predict these prices was like trying to navigate a city without a map. There was no single, clean list of daily prices for these specific crops available to researchers.

  • The Fix: The authors built a new map called AgriPriceBD. They used a smart AI assistant (an LLM) to read thousands of old, messy government PDF reports and turn them into a clean, digital list of prices from 2020 to 2025. Now, anyone can use this map to test their own prediction tools.

2. The Race: Old School vs. High-Tech

The researchers pitted seven different prediction methods against each other. Think of them as runners in a race:

  • The "Lazy" Runner (Naïve Persistence): This model assumes the price tomorrow will be exactly the same as today. It's simple and doesn't try to be smart.
  • The "Statistical" Runners (SARIMA & Prophet): These are classic tools that look for patterns, seasons, and holidays.
  • The "Deep Learning" Runners (BiLSTM, Transformers, Informer): These are fancy, modern AI models that try to learn complex patterns from data, similar to how a human brain learns.

3. The Shocking Results

🏆 The Winner: Sometimes, "Lazy" is Best

The biggest surprise was that for some crops (like garlic and chickpeas), the "Lazy" runner was actually the best or tied for the best.

  • The Analogy: Imagine trying to predict the weather in a place where the weather changes randomly every hour. No matter how complex your supercomputer is, the best guess for "what happens next" is just "it will probably be like it is right now."
  • The Lesson: For some crops, the price moves so randomly (like a drunk person walking home) that complex AI models can't find a pattern to exploit. They just get confused.

❌ The "Smooth" Tool Failed (Prophet)

The Prophet model is famous for being easy to use and great at predicting things like sales or website traffic.

  • The Failure: It failed miserably on these crops.
  • The Analogy: Prophet is like a smoothie blender. It assumes prices change smoothly, like a gentle river. But in Bangladesh's markets, prices are more like a staircase. They stay flat for a week, then suddenly jump up or down because of a policy change or a storm. Prophet tried to draw a smooth curve through these sharp stairs, resulting in a terrible prediction.

⚠️ The "Over-Engineered" Tool Broke (Informer)

The Informer is a very powerful, high-tech AI designed for massive datasets (like predicting stock markets with millions of data points).

  • The Failure: It went crazy. Instead of predicting prices, it started screaming random numbers.
  • The Analogy: Imagine giving a Formula 1 race car to a child to drive to the grocery store. The car is too powerful and complex for the small task. The Informer was trying to find deep, hidden patterns in a tiny dataset, and instead of learning, it started "hallucinating" and amplifying noise. It was too big for the job.

🧪 The "Learnable" Time Trick Didn't Help

The researchers tried a special trick called Time2Vec, which lets the AI "learn" how time works (like knowing that December is always cold).

  • The Result: It didn't help. In fact, for the most volatile crop (green chilies), it made things 146% worse.
  • The Analogy: It's like giving a student a textbook that is too advanced. Instead of helping them learn, the extra complexity just confused them. The simple, fixed way of telling time worked better than the fancy "learning" way.

4. The Green Chili Mystery

Green chilies were the hardest to predict. Their prices jump around wildly due to rain, border closures, and storage issues.

  • The Finding: Even the smartest AI couldn't predict them well. The best strategy was just to guess the price would stay the same as today.
  • The Takeaway: To predict green chilies, you don't need a better price-prediction AI; you need outside information (like rainfall data or import numbers). The price history alone isn't enough.

Summary: What Should We Do?

This paper teaches us three main things for developing economies like Bangladesh:

  1. Don't assume bigger is better: The most complex AI models (like Informer) often fail when you don't have enough data.
  2. Know your market: If prices jump like stairs (discrete steps), don't use tools designed for smooth rivers (like Prophet).
  3. Simplicity wins: Sometimes, the simplest guess (today's price = tomorrow's price) is the most accurate because the market is just too noisy to predict.

The authors have shared their data and code for free, hoping that farmers, policymakers, and other researchers can use this "map" to make better decisions and keep food prices stable for everyone.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →