SelvaBox: A high-resolution dataset for tropical tree crown detection

The paper introduces SelvaBox, the largest open-access high-resolution drone imagery dataset for tropical tree crown detection, which enables the development of robust models that achieve state-of-the-art performance through both zero-shot generalization and unified multi-resolution training.

Hugo Baudchon, Arthur Ouaknine, Martin Weiss, Mélisande Teng, Thomas R. Walla, Antoine Caron-Guay, Christopher Pal, Etienne Laliberté

Published 2026-03-02
📖 5 min read🧠 Deep dive

Imagine you are trying to count every single apple on a massive, tangled apple tree in a dense forest. The branches are thick, the leaves are everywhere, and the apples are hidden behind one another. Now, imagine doing this not just for one tree, but for an entire rainforest, where the "apples" (tree crowns) vary wildly in size, shape, and color, and they are all squished together.

This is the challenge scientists face when trying to monitor tropical forests using satellite photos or drone images. The paper "SELVABOX" introduces a solution to this problem, acting like a massive, high-definition instruction manual for computers to learn how to spot these trees.

Here is a breakdown of the paper using simple analogies:

1. The Problem: The "Needle in a Haystack" Dilemma

Tropical forests are the lungs of our planet, storing huge amounts of carbon and hosting incredible biodiversity. But they are changing fast due to climate change and human activity. To understand what's happening, we need to count and track individual trees.

  • The Old Way: Scientists used to walk through the forest with tape measures. This is slow, dangerous, and covers very little ground.
  • The Satellite Way: Satellites take pictures from space, but they are like looking at a forest from a plane at 30,000 feet. The trees look like a green carpet; you can't see individual trees.
  • The Drone Way: Drones fly low and take super-clear photos (like looking at the forest from a helicopter). But until now, there wasn't enough "training data" to teach computers how to recognize the trees in these photos. Existing datasets were mostly for temperate forests (like pine trees in Canada), which look very different from chaotic, overlapping tropical trees.

2. The Solution: SELVABOX (The "Super-Teacher")

The authors created SELVABOX, which is essentially a massive, high-resolution photo album of tropical forests from Brazil, Ecuador, and Panama.

  • The Scale: Imagine a library. Previous datasets were like a small bookshelf with a few hundred books. SELVABOX is a massive library with over 83,000 individual trees manually labeled by expert biologists. It's an order of magnitude bigger than anything else before.
  • The Quality: The photos are so clear (high resolution) that you can see details as small as a few centimeters. The biologists drew boxes around every single tree crown they could see, even the tricky ones hidden in the shadows.
  • The Variety: It includes different types of forests: ancient primary forests, regrowing secondary forests, and tree plantations. This teaches the computer that trees look different depending on where they are.

3. The Experiment: Teaching the Computer to "See"

The researchers didn't just make the dataset; they used it to train and test different AI models (computer brains) to see which one was the best at finding trees.

  • The "Zoom" Analogy: They tested if it's better to look at a small patch of forest in extreme detail (high resolution) or a large patch with less detail. They found that zooming in (higher resolution) consistently helped the computer spot more trees, especially the small or hidden ones.
  • The "Multi-Task" Analogy: They tried training the AI on just SELVABOX versus training it on SELVABOX plus other datasets from different parts of the world.
    • Result: The AI trained on SELVABOX became so smart that it could look at a tropical forest it had never seen before (in a different country) and still find the trees accurately. This is called Zero-Shot Learning—like a student who learns the rules of chess so well they can beat a grandmaster without ever having played that specific grandmaster before.

4. The Key Findings

  • Resolution Matters: Just like you need a high-definition TV to see the details of a movie, the AI needs high-resolution drone photos to distinguish individual trees in a dense jungle.
  • The "One-Size-Fits-All" Model: They created a system that can handle different photo sizes and zoom levels at the same time. This is like teaching a student to read a book whether it's printed in tiny font or huge letters.
  • Beating the Competition: Their best model (using a specific type of AI architecture called "DINO") outperformed all existing methods. It didn't just work on the data it was trained on; it generalized well to new, unseen forests.

5. Why This Matters (The "Big Picture")

Think of SELVABOX as the Rosetta Stone for tropical forest monitoring.

Before this, if you wanted to study a specific rainforest, you had to build your own training data from scratch, which takes years. Now, researchers and conservationists can use this open-source dataset and the pre-trained models to:

  • Count trees to estimate how much carbon the forest stores.
  • Detect illegal logging by spotting missing trees.
  • Monitor how forests recover after fires or storms.

The Ethical Warning

The authors are careful to note that while this technology is powerful, it's a tool, not a magic wand. It's meant to help scientists and conservationists protect forests, not to help loggers find the most valuable trees to cut down. They emphasize that these AI predictions should be used to support human decision-making, not replace it entirely.

In a Nutshell

SELVABOX is a giant, high-quality dataset that teaches computers how to spot individual trees in the world's most complex forests. By using this "training manual," AI can now act like a super-powered forest ranger, helping us monitor the health of our planet's most vital ecosystems from the sky.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →