Estimating sub-frame time differences in camera image sequences

This paper presents and validates a cross-spectral technique capable of estimating sub-frame relative time delays between optical intensity signals with better than 50-microsecond accuracy, a method developed for dynamic auroral imaging but applicable to various camera timing calibration and time-varying signal measurement tasks.

Original authors: Juha Vierinen, Pavithiran Sivasothy, Björn Gustavsson

Published 2026-05-29
📖 4 min read☕ Coffee break read

Original authors: Juha Vierinen, Pavithiran Sivasothy, Björn Gustavsson

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are watching a movie on your phone. The movie isn't actually a continuous stream of light; it's a rapid series of still pictures (frames) shown 60 times every second. To your eyes, it looks smooth, but to a computer, the world is chopped up into tiny slices of time.

Usually, if you want to know exactly when two things happened in a video, you can only be sure they happened within the same "slice" (about 16 milliseconds apart). But what if you need to know if one event happened just a tiny fraction of a second before the other, even though they both appear in the same picture frame?

This is the problem the paper solves. The authors, researchers from the University of Tromsø, have invented a mathematical "super-sense" that lets a camera see time differences much smaller than the time it takes to snap a single photo.

The Problem: The "Flickering" Sky

The researchers were originally thinking about the Aurora Borealis (Northern Lights). Sometimes, the lights in the sky flicker or change shape very quickly. Scientists believe these changes happen because high-speed electrons rain down from space, hitting different layers of the atmosphere at slightly different times.

If you have two cameras watching the sky, or even one camera looking at two different parts of the sky, you might see a "flicker" in one spot and then a "flicker" in another spot a few milliseconds later. Standard cameras are too slow to catch this tiny gap; they just see it as one big blur. The researchers wanted a way to measure that tiny gap without needing expensive, super-fast military-grade cameras.

The Solution: The "Cross-Spectral" Ear

Instead of trying to take a picture faster, the authors used a clever trick based on sound waves and music.

Think of the changing brightness of the aurora (or a flashing light) like a song. Even if the song is playing, it has a rhythm and a beat.

  1. The Setup: They built a simple device with two LED lights. One light blinked randomly, and the other light blinked in the exact same pattern, but with a tiny, known delay (like a drummer hitting a snare drum a split-second after the hi-hat).
  2. The Recording: They filmed this with a standard smartphone camera.
  3. The Magic: They didn't look at the video frame-by-frame. Instead, they took the "song" of the brightness from the first light and the "song" of the second light and compared them mathematically. This is called a cross-spectrum.

The Analogy: Imagine two people clapping their hands. If they clap at the exact same time, their sounds match perfectly. If one person claps a tiny bit later, their sound is slightly out of sync. By listening to the pattern of the claps over a long time, you can calculate exactly how many microseconds one person is lagging behind the other, even if you can't hear the individual claps clearly.

The math works the same way with light. By analyzing the "rhythm" of the light changes across many frames, they could calculate the time difference between two points on the screen with incredible precision.

The Results: Seeing the Invisible

They tested this method and found:

  • Extreme Precision: They could measure time differences as small as 50 microseconds (that's 0.00005 seconds). To put that in perspective, a standard video frame lasts about 16,000 microseconds. They are measuring gaps that are 300 times smaller than a single frame.
  • The "Rolling Shutter" Effect: They also used this to look at the camera itself. Most smartphone cameras don't take a picture of the whole scene at once; they scan it from top to bottom (like a rolling shutter on a garage door). This means the top of the photo is taken a tiny bit earlier than the bottom. The researchers used their method to map exactly how much time passes as the camera "scans" down the screen, proving they could see the camera's own internal timing quirks.

Why This Matters (According to the Paper)

The paper claims this technique is a game-changer for:

  1. Studying the Aurora: It allows scientists to measure the tiny delays in the Northern Lights caused by electrons traveling through the atmosphere, which was previously impossible with standard video.
  2. Camera Calibration: It can be used to check if different cameras are perfectly synchronized or to measure the internal timing of a single camera's sensor.

The authors emphasize that this works with cheap, everyday equipment (like a smartphone and a simple Arduino microcontroller) and doesn't require expensive hardware. They successfully proved that by looking at the pattern of light changes rather than just the pictures themselves, we can "hear" time passing in fractions of a millisecond.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →