VideoPulse: Neonatal heart rate and peripheral capillary oxygen saturation (SpO2) estimation from contact free video

The paper introduces VideoPulse, a comprehensive dataset and end-to-end deep learning pipeline that enables accurate, contact-free estimation of neonatal heart rate and SpO2 from facial video, offering a low-cost, non-invasive alternative to traditional adhesive monitoring methods in intensive care settings.

Deependra Dewagiri, Kamesh Anuradha, Pabadhi Liyanage, Helitha Kulatunga, Pamuditha Somarathne, Udaya S. K. P. Miriya Thanthrige, Nishani Lucas, Anusha Withana, Joshua P. Kulasingham

Published 2026-03-02
📖 5 min read🧠 Deep dive

Imagine a tiny newborn baby in a hospital nursery. They are fragile, their skin is like wet tissue paper, and they are constantly squirming, kicking, and turning their heads. Right now, doctors have to stick adhesive sensors and probes onto this baby's skin to check their heart rate and oxygen levels. It's uncomfortable for the baby, can cause skin irritation, and increases the risk of infection.

VideoPulse is a new technology that says, "Let's just use a camera."

Think of it like a super-powered digital stethoscope that doesn't need to touch the baby at all. Instead of listening to the heart, it "sees" the heartbeat.

How Does It Work? (The Magic of "Invisible Colors")

You know how your face turns slightly pink when you get excited or exercise? That's because blood is rushing to your skin. Even when you are sleeping, your heart pumps blood in a rhythmic wave. This causes tiny, almost invisible changes in the color of your skin with every beat.

  • The Old Way: Doctors used to try to find these color changes using complex math formulas (like trying to hear a whisper in a hurricane). It often failed if the baby moved or if the lights flickered.
  • The VideoPulse Way: This system uses Deep Learning, which is like teaching a computer to be a master detective. It watches a video of the baby's face and learns to spot those microscopic color shifts, ignoring the baby's wiggles and the changing room lights.

The Three Big Problems They Solved

The researchers faced three huge hurdles, and they built clever tools to overcome them:

1. The "Wobbly Baby" Problem (Face Alignment)

  • The Issue: Babies don't sit still like adults. They roll, tilt, and turn their heads. If you try to take a photo of a baby who is upside down or sideways, a normal camera app might get confused.
  • The Solution: The team built a digital "head-turner." Before the computer tries to read the heartbeat, it automatically rotates the video in its mind (like turning a photo album) until the baby's face is straight. It's like having a nurse gently hold the baby's head steady, but done entirely by software.

2. The "Noisy Signal" Problem (Cleaning the Data)

  • The Issue: To teach the computer, they needed to compare the video to a real heart monitor. But because babies move so much, the real monitor's data was often "noisy" (full of static, like a bad radio station).
  • The Solution: They used a digital "noise-canceling" filter (based on a technology called GANs). Imagine you have a recording of a baby crying, but there's a loud fan in the background. This tool learns to subtract the fan noise and leave only the baby's cry. They cleaned up the "real" data so the computer could learn the right patterns.

3. The "Rare Numbers" Problem (Oxygen Levels)

  • The Issue: Most healthy babies have oxygen levels near 100%. Very few have low levels. If you teach a student only with examples of "100," they will be terrible at guessing "85" when it actually happens.
  • The Solution: They used a technique called Label Distribution Smoothing. Think of it like a teacher who doesn't just grade the student on the most common answers, but gives extra credit for paying attention to the rare, difficult questions. This ensures the system is ready to detect dangerous low-oxygen levels, not just the safe ones.

The New "Training Manual" (VideoPulse Dataset)

To make this work, the researchers couldn't just use old data from adults or babies from other countries. Babies in Sri Lanka (where this study happened) have different skin tones and facial features than babies in the datasets used before.

So, they created a new library of videos called VideoPulse.

  • They filmed 52 real newborns in a Sri Lankan hospital.
  • They recorded 2.6 hours of video, capturing babies in all sorts of positions.
  • This is like creating a new, diverse textbook for the computer to study, ensuring it works for all babies, not just a specific type.

The Results: Fast and Accurate

The results are impressive:

  • Speed: The system can predict the heart rate in just 2 seconds. That's almost instant! (Older methods needed 6 seconds or more).
  • Accuracy: It's accurate enough to meet strict medical standards. The error rate is so small it's almost like guessing the time within a few seconds.
  • Oxygen: This is the big breakthrough. While other systems could guess heart rates, VideoPulse is one of the first to accurately guess oxygen levels (SpO2) just by looking at a video.

Why Does This Matter?

Imagine a future where a baby in the NICU (Neonatal Intensive Care Unit) is monitored by a simple webcam on the ceiling.

  • No more sticky tapes peeling off fragile skin.
  • No more wires tangling the baby.
  • 24/7 monitoring that doesn't disturb the baby's sleep.

VideoPulse turns a standard camera into a life-saving medical tool, making hospital care gentler, safer, and more comfortable for the tiniest patients. It's a perfect example of how technology can be used to give a baby a little more peace and a lot more comfort.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →