rPPG-Based Heart Rate Estimation Using Spatial-Temporal Attention Network

被引:17
作者
Hu, Min [1 ]
Guo, Dong [1 ]
Jiang, Mingxing [1 ,2 ]
Qian, Fei [1 ]
Wang, Xiaohua [1 ]
Ren, Fuji [1 ,3 ]
机构
[1] Hefei Univ Technol, Sch Comp & Informat, Anhui Prov Key Lab Affect Comp & Adv Intelligent M, Hefei 230602, Peoples R China
[2] Sch Commun & Informat Engn, Shanghai Univ, Shanghai 200444, Peoples R China
[3] Univ Tokushima, Grad Sch Adv Technol & Sci, Tokushima 7708502, Japan
基金
中国国家自然科学基金;
关键词
Heart rate; Videos; Cameras; Neural networks; Estimation; Deep learning; Physiology; Spatiotemporal phenomena; Photoplethysmography; Deep neural networks; heart rate (HR) estimation; remote photoplethysmography (rPPG); spatial-temporal attention; REMOTE PHOTOPLETHYSMOGRAPHY; NONCONTACT; PPG;
D O I
10.1109/TCDS.2021.3131197
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Remote photoplethysmography (rPPG) based on the computer vision technology is widely used to calculate the heart rate (HR) from facial videos. Existing rPPG techniques have been subject to some limitations [e.g., highly redundant spatial information, head movement noise, and region of interest (ROI) selection]. To address these limitations, this article introduces an effective spatial-temporal attention network. A temporal fusion module is first proposed to fully exploit the time-domain information, aiming to reduce the redundant video information and strengthen the association relationships of long-range videos. Furthermore, a spatial attention mechanism is designed in the backbone net to precisely target the skin ROIs. Finally, to assist the network in learning the weights between channels, we project the RGB images using the plane orthogonal to skin (POS) algorithm and add motion representation to complement physiological signals' extraction. The extensive experiments on the public PURE, MMSE-HR, and UBFC-rPPG datasets demonstrate that our model achieves competitive results compared with other methods.
引用
收藏
页码:1630 / 1641
页数:12
相关论文
共 59 条
[31]  
Niu XS, 2018, INT C PATT RECOG, P3580, DOI 10.1109/ICPR.2018.8546321
[32]   PP-Net: A Deep Learning Framework for PPG-Based Blood Pressure and Heart Rate Estimation [J].
Panwar, Madhuri ;
Gautam, Arvind ;
Biswas, Dwaipayan ;
Acharyya, Amit .
IEEE SENSORS JOURNAL, 2020, 20 (17) :10000-10011
[33]   HeartTrack: Convolutional neural network for remote video-based heart rate monitoring [J].
Perepelkina, Olga ;
Artemyev, Mikhail ;
Churikova, Marina ;
Grinenko, Mikhail .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :1163-1171
[34]   Advancements in Noncontact, Multiparameter Physiological Measurements Using a Webcam [J].
Poh, Ming-Zher ;
McDuff, Daniel J. ;
Picard, Rosalind W. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2011, 58 (01) :7-11
[35]   Non-contact, automated cardiac pulse measurements using video imaging and blind source separation [J].
Poh, Ming-Zher ;
McDuff, Daniel J. ;
Picard, Rosalind W. .
OPTICS EXPRESS, 2010, 18 (10) :10762-10774
[36]   EVM-CNN: Real-Time Contactless Heart Rate Estimation From Facial Video [J].
Qiu, Ying ;
Liu, Yang ;
Arteaga-Falconi, Juan ;
Dong, Haiwei ;
El Saddik, Abdulmotaleb .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (07) :1778-1787
[37]  
Selvaraju RR, 2020, INT J COMPUT VISION, V128, P336, DOI [10.1007/s11263-019-01228-7, 10.1109/ICCV.2017.74]
[38]  
Song R., 2020, IEEE Trans. Instrum. Meas., V70, P1
[39]   Heart Rate Estimation From Facial Videos Using a Spatiotemporal Representation With Convolutional Neural Networks [J].
Song, Rencheng ;
Zhang, Senle ;
Li, Chang ;
Zhang, Yunfei ;
Cheng, Juan ;
Chen, Xun .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (10) :7411-7421
[40]   New insights on super-high resolution for video-based heart rate estimation with a semi-blind source separation method [J].
Song, Rencheng ;
Zhang, Senle ;
Cheng, Juan ;
Li, Chang ;
Chen, Xun .
COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 116