Contactless blood oxygen estimation from face videos: A multi-model fusion method based on deep learning

被引:10
|
作者
Hu, Min [1 ]
Wu, Xia [1 ]
Wang, Xiaohua [1 ]
Xing, Yan [2 ]
An, Ning [1 ,3 ]
Shi, Piao [1 ]
机构
[1] Hefei Univ Technol, Key Lab Knowledge Engn Big Data, Anhui Prov Key Lab Affect Comp & Adv Intelligent M, Minist Educ, Hefei 230601, Anhui, Peoples R China
[2] Hefei Univ Technol, Sch Math, Hefei 230601, Anhui, Peoples R China
[3] Hefei Univ Technol, Natl Smart Eldercare Int S&T Cooperat Base, Hefei 230601, Anhui, Peoples R China
关键词
Estimation; Remote photo-plethysmography; Deep learning; Residual network; Coordinate attention; Multi-model fusion; PULSE; NONCONTACT; SIGNAL;
D O I
10.1016/j.bspc.2022.104487
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Blood Oxygen (SpO2), a key indicator of respiratory function, has received increasing attention during the COVID-19 pandemic. Clinical results show that patients with COVID-19 likely have distinct lower SpO2 before the onset of significant symptoms. Aiming at the shortcomings of current methods for monitoring SpO2 by face videos, this paper proposes a novel multi-model fusion method based on deep learning for SpO2 estimation. The method includes the feature extraction network named Residuals and Coordinate Attention (RCA) and the multimodel fusion SpO2 estimation module. The RCA network uses the residual block cascade and coordinate attention mechanism to focus on the correlation between feature channels and the location information of feature space. The multi-model fusion module includes the Color Channel Model (CCM) and the Network-Based Model(NBM). To fully use the color feature information in face videos, an image generator is constructed in the CCM to calculate SpO2 by reconstructing the red and blue channel signals. Besides, to reduce the disturbance of other physiological signals, a novel two-part loss function is designed in the NBM. Given the complementarity of the features and models that CCM and NBM focus on, a Multi-Model Fusion Model(MMFM) is constructed. The experimental results on the PURE and VIPL-HR datasets show that three models meet the clinical requirement (the mean absolute error <= 2%) and demonstrate that the multi-model fusion can fully exploit the SpO2 features of face videos and improve the SpO2 estimation performance. Our research achievements will facilitate applications in remote medicine and home health.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Contactless Blood Oxygen Saturation Estimation from Facial Videos Using Deep Learning
    Cheng, Chun-Hong
    Yuen, Zhikun
    Chen, Shutao
    Wong, Kwan-Long
    Chin, Jing-Wei
    Chan, Tsz-Tai
    So, Richard H. Y.
    BIOENGINEERING-BASEL, 2024, 11 (03):
  • [2] A hybrid deep learning technique based integrated multi-model data fusion for forensic investigation
    Senthil, P.
    Selvakumar, S.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6849 - 6862
  • [3] A hybrid deep learning technique based integrated multi-model data fusion for forensic investigation
    Senthil P.
    Selvakumar S.
    Journal of Intelligent and Fuzzy Systems, 2022, 43 (05) : 6849 - 6862
  • [4] Pathogenic virus detection method based on multi-model fusion
    Zhao, Xiaoyong
    Wang, Jingwei
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2020, : 89 - 92
  • [5] Heart Rate and Oxygen Level Estimation from Facial Videos Using a Hybrid Deep Learning Model
    Zheng, Yufeng
    MULTIMODAL IMAGE EXPLOITATION AND LEARNING 2024, 2024, 13033
  • [6] Conventional and deep learning methods in heart rate estimation from RGB face videos
    Helwan, Abdulkader
    Azar, Danielle
    Ma'aitah, Mohamad Khaleel Sallam
    PHYSIOLOGICAL MEASUREMENT, 2024, 45 (02)
  • [7] A deep learning-based multi-model ensemble method for cancer prediction
    Xiao, Yawen
    Wu, Jun
    Lin, Zongli
    Zhao, Xiaodong
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 153 : 1 - 9
  • [8] A multi-model data-fusion based deep transfer learning for improved remaining useful life estimation for IIOT based systems
    Behera, Sourajit
    Misra, Rajiv
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [9] MULTI-MODEL FUSION PHOTOVOLTAIC POWER GENERATION PREDICTION METHOD BASED ON REINFORCEMENT LEARNING
    Wang J.
    Fu J.
    Chen B.
    Taiyangneng Xuebao/Acta Energiae Solaris Sinica, 2024, 45 (06): : 382 - 388
  • [10] A deep learning-based multi-model ensemble method for eye state recognition from EEG
    Islalm, Md Shafiqul
    Rahman, Md Moklesur
    Rahman, Md Hafizur
    Hoque, Md Robiul
    Roonizi, Arman Kheirati
    Aktaruzzaman, Md
    2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 819 - 824