Contactless Blood Oxygen Saturation Estimation from Facial Videos Using Deep Learning

被引：4

作者：

Cheng, Chun-Hong ^{[1
]}

Yuen, Zhikun ^{[2
]}

Chen, Shutao ^{[3
]}

Wong, Kwan-Long ^{[3
]}

Chin, Jing-Wei ^{[3
]}

Chan, Tsz-Tai ^{[3
]}

So, Richard H. Y. ^{[3
,4
]}

机构：

[1] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England

[2] Univ Ottawa, Dept Biomol Sci, Ottawa, ON K1H 8M5, Canada

[3] Hong Kong Sci & Technol Pk, Hong Kong, Peoples R China

[4] Hong Kong Univ Sci & Technol, Dept Ind Engn & Decis Analyt, Kowloon, Clear Water Bay, Hong Kong, Peoples R China

来源：

BIOENGINEERING-BASEL | 2024年 / 11卷 / 03期

关键词：

blood oxygen saturation measurement; deep learning; facial videos; non-contact monitoring; remote health monitoring;

D O I：

10.3390/bioengineering11030251

中图分类号：

Q81 [生物工程学（生物技术）]; Q93 [微生物学];

学科分类号：

071005 ; 0836 ; 090102 ; 100705 ;

摘要：

Blood oxygen saturation (SpO2) is an essential physiological parameter for evaluating a person's health. While conventional SpO2 measurement devices like pulse oximeters require skin contact, advanced computer vision technology can enable remote SpO2 monitoring through a regular camera without skin contact. In this paper, we propose novel deep learning models to measure SpO2 remotely from facial videos and evaluate them using a public benchmark database, VIPL-HR. We utilize a spatial-temporal representation to encode SpO2 information recorded by conventional RGB cameras and directly pass it into selected convolutional neural networks to predict SpO2. The best deep learning model achieves 1.274% in mean absolute error and 1.71% in root mean squared error, which exceed the international standard of 4% for an approved pulse oximeter. Our results significantly outperform the conventional analytical Ratio of Ratios model for contactless SpO2 measurement. Results of sensitivity analyses of the influence of spatial-temporal representation color spaces, subject scenarios, acquisition devices, and SpO2 ranges on the model performance are reported with explainability analyses to provide more insights for this emerging research field.

引用

页数：16

共 50 条

[21] Modelling an efficient hybridized approach for facial emotion recognition using unconstraint videos and deep learning approaches
Bhushanam, P. Naga
Kumar, S. Selva
SOFT COMPUTING, 2024, 28 (05) : 3823 - 3846
[22] Modelling an efficient hybridized approach for facial emotion recognition using unconstraint videos and deep learning approaches
P. Naga Bhushanam
S. Selva Kumar
Soft Computing, 2024, 28 : 4593 - 4606
[23] Violence Detection From Industrial Surveillance Videos Using Deep Learning
Khan, Hamza
Yuan, Xiaohong
Qingge, Letu
Roy, Kaushik
IEEE ACCESS, 2025, 13 : 15363 - 15375
[24] Emotion Classification Based on Pulsatile Images Extracted from Short Facial Videos via Deep Learning
Talala, Shlomi
Shvimmer, Shaul
Simhon, Rotem
Gilead, Michael
Yitzhaky, Yitzhak
SENSORS, 2024, 24 (08)
[25] Learning Deep Contrastive Network for Facial Age Estimation
Kong, Chang
Luo, Qiuming
Chen, Guoliang
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[26] Recognition of driver's distraction based on facial thermal videos by deep learning
Aghaomidi, Poorya
Bahmani, Zahra
Mohammdian, Amin
2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 811 - 815
[27] Heart Rate Estimation From Facial Videos Using a Spatiotemporal Representation With Convolutional Neural Networks
Song, Rencheng
Zhang, Senle
Li, Chang
Zhang, Yunfei
Cheng, Juan
Chen, Xun
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (10) : 7411 - 7421
[28] Blood Pressure Estimation from PPG Signals Using Deep Residual Network with Transfer Learning
Koparir, Huseyin Murat
Arslan, Ozkan
2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
[29] Suspicious Human Activity Recognition From Surveillance Videos Using Deep Learning
Mohamed Zaidi, Monji
Avelino Sampedro, Gabriel
Almadhor, Ahmad
Alsubai, Shtwai
Al Hejaili, Abdullah
Gregus, Michal
Abbas, Sidra
IEEE ACCESS, 2024, 12 : 105497 - 105510
[30] Road User Position and Speed Estimation via Deep Learning from Calibrated Fisheye Videos
Berviller, Yves
Ansarnia, Masoomeh Shireen
Tisserand, Etienne
Schweitzer, Patrick
Tremeau, Alain
SENSORS, 2023, 23 (05)

← 1 2 3 4 5 →