Contactless Blood Oxygen Saturation Estimation from Facial Videos Using Deep Learning

被引:4
|
作者
Cheng, Chun-Hong [1 ]
Yuen, Zhikun [2 ]
Chen, Shutao [3 ]
Wong, Kwan-Long [3 ]
Chin, Jing-Wei [3 ]
Chan, Tsz-Tai [3 ]
So, Richard H. Y. [3 ,4 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
[2] Univ Ottawa, Dept Biomol Sci, Ottawa, ON K1H 8M5, Canada
[3] Hong Kong Sci & Technol Pk, Hong Kong, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Ind Engn & Decis Analyt, Kowloon, Clear Water Bay, Hong Kong, Peoples R China
来源
BIOENGINEERING-BASEL | 2024年 / 11卷 / 03期
关键词
blood oxygen saturation measurement; deep learning; facial videos; non-contact monitoring; remote health monitoring;
D O I
10.3390/bioengineering11030251
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Blood oxygen saturation (SpO2) is an essential physiological parameter for evaluating a person's health. While conventional SpO2 measurement devices like pulse oximeters require skin contact, advanced computer vision technology can enable remote SpO2 monitoring through a regular camera without skin contact. In this paper, we propose novel deep learning models to measure SpO2 remotely from facial videos and evaluate them using a public benchmark database, VIPL-HR. We utilize a spatial-temporal representation to encode SpO2 information recorded by conventional RGB cameras and directly pass it into selected convolutional neural networks to predict SpO2. The best deep learning model achieves 1.274% in mean absolute error and 1.71% in root mean squared error, which exceed the international standard of 4% for an approved pulse oximeter. Our results significantly outperform the conventional analytical Ratio of Ratios model for contactless SpO2 measurement. Results of sensitivity analyses of the influence of spatial-temporal representation color spaces, subject scenarios, acquisition devices, and SpO2 ranges on the model performance are reported with explainability analyses to provide more insights for this emerging research field.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Violence Detection in Videos Using Deep Learning: A Survey
    Kaur, Gurmeet
    Singh, Sarbjeet
    ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 : 165 - 173
  • [32] ROSE-Net: Leveraging remote photoplethysmography to estimate oxygen saturation using deep learning
    Chowdhury, Moajjem Hossain
    Reaz, Mamun Bin Ibne
    Ali, Sawal Hamid Md
    Khan, Muhammad Salman
    Chowdhury, Muhammad E. H.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [33] Radar-Based Contactless Blood Pressure Estimation System Using Signal Decomposition and Deep Neural Network
    Wang, Yong
    Wang, Sibo
    Fang, Chao
    Zhou, Mu
    Yang, Xiaolong
    Zhang, Qian
    Pang, Yu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [34] Automatic stress analysis from facial videos based on deep facial action units recognition
    Giannakakis, Giorgos
    Koujan, Mohammad Rami
    Roussos, Anastasios
    Marias, Kostas
    PATTERN ANALYSIS AND APPLICATIONS, 2022, 25 (03) : 521 - 535
  • [35] Automatic stress analysis from facial videos based on deep facial action units recognition
    Giorgos Giannakakis
    Mohammad Rami Koujan
    Anastasios Roussos
    Kostas Marias
    Pattern Analysis and Applications, 2022, 25 : 521 - 535
  • [36] Emotion Recognition from Facial Expression using Explainable Deep Learning
    Cesarelli, Mario
    Martinelli, Fabio
    Mercaldo, Francesco
    Santone, Antonella
    2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 306 - 311
  • [37] Deep Learning Model for Blood Pressure Estimation from PPG Signal
    Kim, Minseong
    Lee, Hyeonjeong
    Kim, Kwang-Yong
    Kim, Kyu-Hyung
    2022 IEEE INTERNATIONAL CONFERENCE ON METROLOGY FOR EXTENDED REALITY, ARTIFICIAL INTELLIGENCE AND NEURAL ENGINEERING (METROXRAINE), 2022, : 1 - 5
  • [38] Deep domain-invariant learning for facial age estimation
    Bao, Zenghao
    Luo, Yutian
    Tan, Zichang
    Wan, Jun
    Ma, Xibo
    Lei, Zhen
    NEUROCOMPUTING, 2023, 534 : 86 - 93
  • [39] A novel comparative deep learning framework for facial age estimation
    Abousaleh, Fatma S.
    Lim, Tekoing
    Cheng, Wen-Huang
    Yu, Neng-Hao
    Hossain, M. Anwar
    Alhamid, Mohammed F.
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,
  • [40] Facial Attractiveness Classification using Deep Learning
    Anderson, Ricky
    Gema, Aryo Pradipta
    Suharjito
    Isa, Sani M.
    2018 INDONESIAN ASSOCIATION FOR PATTERN RECOGNITION INTERNATIONAL CONFERENCE (INAPR), 2018, : 34 - 38