Contactless Blood Oxygen Saturation Estimation from Facial Videos Using Deep Learning

Chun Hong Cheng*, Zhikun Yuen, Shutao Chen, Kwan Long Wong, Jing Wei Chin, Tsz Tai Chan, Richard H.Y. So

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

16 Citations (Scopus)

Abstract

Blood oxygen saturation (SpO2) is an essential physiological parameter for evaluating a person’s health. While conventional SpO2 measurement devices like pulse oximeters require skin contact, advanced computer vision technology can enable remote SpO2 monitoring through a regular camera without skin contact. In this paper, we propose novel deep learning models to measure SpO2 remotely from facial videos and evaluate them using a public benchmark database, VIPL-HR. We utilize a spatial–temporal representation to encode SpO2 information recorded by conventional RGB cameras and directly pass it into selected convolutional neural networks to predict SpO2. The best deep learning model achieves 1.274% in mean absolute error and 1.71% in root mean squared error, which exceed the international standard of 4% for an approved pulse oximeter. Our results significantly outperform the conventional analytical Ratio of Ratios model for contactless SpO2 measurement. Results of sensitivity analyses of the influence of spatial–temporal representation color spaces, subject scenarios, acquisition devices, and SpO2 ranges on the model performance are reported with explainability analyses to provide more insights for this emerging research field.

Original languageEnglish
Article number251
JournalBioengineering
Volume11
Issue number3
DOIs
Publication statusPublished - Mar 2024

Bibliographical note

Publisher Copyright:
© 2024 by the authors.

Keywords

  • blood oxygen saturation measurement
  • deep learning
  • facial videos
  • non-contact monitoring
  • remote health monitoring

Fingerprint

Dive into the research topics of 'Contactless Blood Oxygen Saturation Estimation from Facial Videos Using Deep Learning'. Together they form a unique fingerprint.

Cite this