Field imaging framework for morphological characterization of aggregates with computer vision: Algorithms and applications

This dissertation presents a comprehensive field imaging framework that leverages advanced computer vision algorithms, including 2D instance segmentation and an integrated 3D reconstruction-segmentation-completion approach, to overcome the limitations of traditional methods and enable accurate morphological characterization of construction aggregates across diverse field scenarios.

Haohang Huang

Published 2026-03-05
📖 5 min read🧠 Deep dive

Imagine you are a construction engineer. Your job is to build roads, bridges, and dams. To do this, you need aggregates—basically, rocks, gravel, and sand. These aren't just any rocks; they are the "skeleton" of our infrastructure.

For decades, checking if these rocks are the right size and shape has been a nightmare.

  • The Old Way: Engineers would stand in a dusty quarry, squinting at a pile of rocks the size of small cars. They'd guess the size, maybe use a giant ruler, or weigh them one by one with heavy machinery. It was slow, subjective (based on who was looking), and often inaccurate.
  • The Problem: You can't put a giant rock on a kitchen scale, and you can't easily measure a rock buried under a pile of other rocks.

The Solution: This dissertation introduces a "Digital Eye" for construction. It's a smart system that uses cameras, smartphones, and artificial intelligence (AI) to take pictures of rock piles and instantly tell you exactly what's inside them—down to the volume of every single rock.

Here is how the system works, broken down into three simple levels, like a video game getting harder:

Level 1: The Solo Rock (The "Individual" Mode)

Imagine you have one giant rock sitting alone.

  • The Trick: You take three photos of it from different angles (top, front, side) using a smartphone. You also snap a photo of a known object, like a basketball, to act as a ruler.
  • The Magic: The computer looks at the photos, cuts the rock out of the background (even if the sun is glaring or there are shadows), and "stitches" the three photos together to build a 3D model.
  • The Result: It calculates the rock's volume and weight with amazing accuracy. It's like turning a flat photo into a 3D hologram that you can measure.
  • Why it matters: It's much faster and more accurate than an engineer trying to guess the size with a tape measure.

Level 2: The Rock Pile (The "2D Crowd" Mode)

Now, imagine a massive pile of thousands of rocks. You can't measure them one by one.

  • The Trick: You take a photo of the whole pile.
  • The Magic: This is where the AI shines. Think of the AI as a super-organized librarian. It looks at the messy photo and says, "Okay, that's Rock #1, that's Rock #2, and that's Rock #3." It separates them automatically, even if they are touching or overlapping.
  • The Result: It counts them all, measures their shapes, and tells you the "grade" of the pile (e.g., "This pile has 60% big rocks and 40% medium rocks").
  • Why it matters: It turns a chaotic pile into a neat spreadsheet in seconds.

Level 3: The Invisible Rock (The "3D Detective" Mode)

This is the hardest part. In a real pile, rocks are stacked on top of each other. If you take a photo, you only see the top half of the rocks. The bottom half is hidden. How do you know the full size?

  • The Trick: The system uses a "Virtual Reality" lab.
    1. The Library: First, the researchers built a digital library of perfect 3D models of real rocks.
    2. The Simulator: They used a computer game engine to drop these digital rocks into a virtual pile, just like gravity works in real life. Because it's a simulation, the computer knows the exact shape of every rock, even the parts that are hidden.
    3. The Training: They taught the AI to look at the "fake" pile and guess the hidden parts. It's like showing a student a picture of a half-eaten apple and asking them to draw the whole apple.
    4. The Real World: Finally, they took this trained AI to a real quarry. It scanned a real pile, guessed the hidden parts of the rocks, and reconstructed the full 3D shape.
  • The Result: It can estimate the weight and volume of rocks that are partially buried, which was previously impossible.

The "Secret Sauce": Shape Percentage

The researchers realized that sometimes the AI guesses wrong because it can't see enough of the rock. So, they invented a "Confidence Score" called Shape Percentage (SP).

  • If the AI can only see 50% of a rock, it says, "I'm not sure about this one."
  • If it can see 80%, it says, "I'm pretty confident."
  • By filtering out the low-confidence guesses, the final results become incredibly accurate.

Why Should You Care?

This isn't just about rocks; it's about building a better world.

  • Safety: If a bridge is built with rocks that are too small or the wrong shape, it could collapse. This system ensures the rocks are perfect.
  • Money: Construction companies waste millions of dollars guessing wrong. This system saves time, fuel, and money.
  • Speed: What used to take a team of engineers days to measure can now be done in minutes with a smartphone and some software.

In a nutshell: This research took the messy, dusty, hard-to-measure world of construction rocks and gave it a "digital twin." It uses cameras to see, AI to think, and math to measure, turning a pile of chaos into a precise, data-driven blueprint for building our future.