Heart Rate Estimation From Remote Photoplethysmography Based on Light-Weight U-Net and Attention Modules

被引:8
作者
Yu, Sung-Nien [1 ,2 ]
Wang, Chien-Shun [1 ]
Chang, Yu Ping [1 ]
机构
[1] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi 621301, Taiwan
[2] Natl Chung Cheng Univ, Ctr Innovat Res Aging Soc CIRAS, Chiayi, Taiwan
关键词
Heart rate variability; Heart rate; Spatiotemporal phenomena; Deep learning; Face recognition; Photoplethysmography; Image color analysis; Attention; remote photoplethysmography; remote heart rate estimation; spatio-temporal map;
D O I
10.1109/ACCESS.2023.3281898
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiac signals are frequently used in disease and emotion analyses. However, current measurement methods mostly require direct contact. Remote photoplethysmography (rPPG) has been proposed in recent years which measures minute variations in color on the face due to blood volume changes as the heart pumps, using a consumer grade camera. In this study, we proposed a deep learning framework based on a light-weight and task-adapted version of U-Net to extract rPPG. The face video was converted into multiscale spatio-temporal map (MSTmap) as input to the network. Two types of attention mechanisms were added, namely variations of the squeeze-and-excitation block (SE block), which compresses global information to enhance the channel and ROI signals, and the multihead attention block with position encoding, which extracts information from different parts of the signal. We further propose using virtual PPG (vPPG) as a replacement for PPG ground-truth so that the model focuses on learning the peak information instead of morphological details. Extensive experiments were conducted using the UBFC-rPPG dataset for heart rate (HR) and heart rate variability (HRV) estimations. The model achieved a root-mean-square error of 0.78 bpm and correlation coefficient of 0.99 in heart rate estimation, which is comparable to state-of-the-art while being more light-weight.
引用
收藏
页码:54058 / 54069
页数:12
相关论文
共 28 条
[1]   Heart rate variability: Origins, methods, and interpretive caveats [J].
Berntson, GG ;
Bigger, JT ;
Eckberg, DL ;
Grossman, P ;
Kaufmann, PG ;
Malik, M ;
Nagaraja, HN ;
Porges, SW ;
Saul, JP ;
Stone, PH ;
VanderMolen, MW .
PSYCHOPHYSIOLOGY, 1997, 34 (06) :623-648
[2]   Unsupervised skin tissue segmentation for remote photoplethysmography [J].
Bobbia, Serge ;
Macwan, Richard ;
Benezeth, Yannick ;
Mansouri, Alamin ;
Dubois, Julien .
PATTERN RECOGNITION LETTERS, 2019, 124 :82-90
[3]   DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks [J].
Chen, Weixuan ;
McDuff, Daniel .
COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 :356-373
[4]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[5]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[6]   Robust Pulse Rate From Chrominance-Based rPPG [J].
de Haan, Gerard ;
Jeanne, Vincent .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2013, 60 (10) :2878-2886
[7]  
King DE, 2015, Arxiv, DOI arXiv:1502.00046
[8]   Optimal color channel combination across skin tones for remote heart rate measurement in camera-based photoplethysmography [J].
Ernst, Hannes ;
Scherpf, Matthieu ;
Malberg, Hagen ;
Schmidt, Martin .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 68
[9]  
Feldman N., 81 FACIAL LANDMARKS
[10]   GhostNet: More Features from Cheap Operations [J].
Han, Kai ;
Wang, Yunhe ;
Tian, Qi ;
Guo, Jianyuan ;
Xu, Chunjing ;
Xu, Chang .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586