An Open-Source Training Dataset for Foundation Models for Black-box Optimization

This paper introduces BBO-Pile, the first large-scale open-source dataset containing over 500,000 optimization trajectories across 3,095 black-box functions, and demonstrates that foundation models trained on this data can effectively learn and imitate black-box optimization strategies through scalable pre-training.

Original authors: Aaron Klein, Herilalaina Rakotoarison, Luca Thale-Bombien, David Salinas

Published 2026-05-25✓ Author reviewed
📖 5 min read🧠 Deep dive

Original authors: Aaron Klein, Herilalaina Rakotoarison, Luca Thale-Bombien, David Salinas

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

The Big Problem: The "Black Box" Mystery

Imagine you are trying to bake the perfect cake, but you have a magical oven that is completely sealed. You can't see inside, you don't know the recipe, and you can't measure the temperature. The only way to learn is to put a cake in, wait for it to bake, take it out, and taste it.

  • The Cake: This is the "objective function" (the problem you want to solve).
  • The Ingredients: These are the "hyperparameters" (settings like learning rate, number of layers, etc.).
  • The Taste: This is the "score" (how good the result is).

This is called Black-Box Optimization. It happens everywhere: tuning AI models, designing new drugs, or configuring robots. The problem is that finding the perfect "cake" usually requires a human expert to guess, tweak, and taste thousands of times. It's slow, expensive, and the expert's tricks often don't work if you switch from baking a cake to baking bread.

The Old Way vs. The New Idea

The Old Way: Scientists have built many different "tasting experts" (algorithms) over the years. One expert is great at finding cake recipes, but terrible at finding bread recipes. They are specialized tools.

The New Idea (Foundation Models): What if we could train a single, super-smart AI to learn the general principles of baking? Instead of being a cake expert or a bread expert, it would be a "Master Baker" that understands how to optimize any recipe just by looking at thousands of past baking attempts.

The Missing Ingredient: A Giant Cookbook

To train this "Master Baker," you need a massive library of past baking attempts (data).

  • The Problem: Previous attempts to do this relied on secret data (which no one else could see) or made-up data (which didn't reflect real life). It was like trying to teach a chef using a cookbook written in a language no one speaks, or using fake ingredients.
  • The Solution (BBO-Pile): The authors created BBO-Pile, the first open-source "Cookbook" for this task.
    • It contains 557,100 different baking attempts (trajectories).
    • These attempts cover 3,095 different types of problems (from tuning AI models to chemical design).
    • It includes data from 6 different "tasting experts" (algorithms) so the AI can learn different strategies.
    • It is massive: about 2.5 billion words (tokens) of data.

How They Trained the "Master Baker"

The authors didn't just give the AI the cookbook; they trained a family of AI models (like different-sized chefs) to read it.

  • The Models: They built models ranging from small (2 million parameters) to large (80 million parameters).
  • The Training: They fed the models the data and asked them to predict the next step in a baking process.
    • Input: "Here is the recipe so far, and here is how the last cake tasted."
    • Output: "Here is the next ingredient mix you should try."
  • The Result: The AI learned to mimic the behavior of the original human experts. If you told the AI to act like "Expert A," it acted like Expert A. If you told it to act like "Expert B," it switched strategies.

What They Discovered

  1. Bigger is Better (but with limits): As they made the AI models bigger and fed them more data, the models got better at mimicking the experts. However, the improvement wasn't as explosive as it is with chatbots (LLMs); it was a steady, predictable climb.
  2. Generalization: The AI didn't just memorize the recipes in the book. When they tested it on a new type of problem it had never seen before (like a completely new type of bread), it still performed surprisingly well. It had learned the logic of optimization, not just the specific answers.
  3. Speed: Once trained, the AI can suggest the next step almost instantly, much faster than running complex mathematical simulations from scratch.

The Bottom Line

This paper is like building the first public library of "optimization stories." By sharing this massive dataset (BBO-Pile), the authors have allowed other researchers to train their own "Master Baker" AI.

They proved that you can train a general-purpose AI to understand how to solve complex, unknown problems by simply showing it how other methods solved similar problems in the past. It's a step toward an AI that doesn't just solve one puzzle, but knows how to figure out any puzzle.

Important Note: The paper focuses entirely on creating this dataset and training these models to mimic existing optimization methods. It does not claim to have solved specific real-world problems (like curing a disease or designing a specific rocket) yet, nor does it discuss future clinical applications. The goal was simply to prove that this "Foundation Model" approach works and to provide the data so others can try it.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →