Contactless Blood Oxygen Saturation Estimation from Facial Videos Using Deep Learning

被引:4
|
作者
Cheng, Chun-Hong [1 ]
Yuen, Zhikun [2 ]
Chen, Shutao [3 ]
Wong, Kwan-Long [3 ]
Chin, Jing-Wei [3 ]
Chan, Tsz-Tai [3 ]
So, Richard H. Y. [3 ,4 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
[2] Univ Ottawa, Dept Biomol Sci, Ottawa, ON K1H 8M5, Canada
[3] Hong Kong Sci & Technol Pk, Hong Kong, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Ind Engn & Decis Analyt, Kowloon, Clear Water Bay, Hong Kong, Peoples R China
来源
BIOENGINEERING-BASEL | 2024年 / 11卷 / 03期
关键词
blood oxygen saturation measurement; deep learning; facial videos; non-contact monitoring; remote health monitoring;
D O I
10.3390/bioengineering11030251
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Blood oxygen saturation (SpO2) is an essential physiological parameter for evaluating a person's health. While conventional SpO2 measurement devices like pulse oximeters require skin contact, advanced computer vision technology can enable remote SpO2 monitoring through a regular camera without skin contact. In this paper, we propose novel deep learning models to measure SpO2 remotely from facial videos and evaluate them using a public benchmark database, VIPL-HR. We utilize a spatial-temporal representation to encode SpO2 information recorded by conventional RGB cameras and directly pass it into selected convolutional neural networks to predict SpO2. The best deep learning model achieves 1.274% in mean absolute error and 1.71% in root mean squared error, which exceed the international standard of 4% for an approved pulse oximeter. Our results significantly outperform the conventional analytical Ratio of Ratios model for contactless SpO2 measurement. Results of sensitivity analyses of the influence of spatial-temporal representation color spaces, subject scenarios, acquisition devices, and SpO2 ranges on the model performance are reported with explainability analyses to provide more insights for this emerging research field.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Modelling an efficient hybridized approach for facial emotion recognition using unconstraint videos and deep learning approaches
    Bhushanam, P. Naga
    Kumar, S. Selva
    SOFT COMPUTING, 2024, 28 (05) : 3823 - 3846
  • [22] Modelling an efficient hybridized approach for facial emotion recognition using unconstraint videos and deep learning approaches
    P. Naga Bhushanam
    S. Selva Kumar
    Soft Computing, 2024, 28 : 4593 - 4606
  • [23] Violence Detection From Industrial Surveillance Videos Using Deep Learning
    Khan, Hamza
    Yuan, Xiaohong
    Qingge, Letu
    Roy, Kaushik
    IEEE ACCESS, 2025, 13 : 15363 - 15375
  • [24] Emotion Classification Based on Pulsatile Images Extracted from Short Facial Videos via Deep Learning
    Talala, Shlomi
    Shvimmer, Shaul
    Simhon, Rotem
    Gilead, Michael
    Yitzhaky, Yitzhak
    SENSORS, 2024, 24 (08)
  • [25] Learning Deep Contrastive Network for Facial Age Estimation
    Kong, Chang
    Luo, Qiuming
    Chen, Guoliang
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [26] Recognition of driver's distraction based on facial thermal videos by deep learning
    Aghaomidi, Poorya
    Bahmani, Zahra
    Mohammdian, Amin
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 811 - 815
  • [27] Heart Rate Estimation From Facial Videos Using a Spatiotemporal Representation With Convolutional Neural Networks
    Song, Rencheng
    Zhang, Senle
    Li, Chang
    Zhang, Yunfei
    Cheng, Juan
    Chen, Xun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (10) : 7411 - 7421
  • [28] Blood Pressure Estimation from PPG Signals Using Deep Residual Network with Transfer Learning
    Koparir, Huseyin Murat
    Arslan, Ozkan
    2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [29] Suspicious Human Activity Recognition From Surveillance Videos Using Deep Learning
    Mohamed Zaidi, Monji
    Avelino Sampedro, Gabriel
    Almadhor, Ahmad
    Alsubai, Shtwai
    Al Hejaili, Abdullah
    Gregus, Michal
    Abbas, Sidra
    IEEE ACCESS, 2024, 12 : 105497 - 105510
  • [30] Road User Position and Speed Estimation via Deep Learning from Calibrated Fisheye Videos
    Berviller, Yves
    Ansarnia, Masoomeh Shireen
    Tisserand, Etienne
    Schweitzer, Patrick
    Tremeau, Alain
    SENSORS, 2023, 23 (05)