A Two-Stream Deep-Learning Network for Heart Rate Estimation From Facial Image Sequence

被引:1
作者
Lie, Wen-Nung [1 ]
Le, Dao Q. [1 ,2 ,3 ]
Huang, Po-Han [1 ]
Fu, Guan-Hao [1 ]
Quynh, Anh Nguyen Thi [2 ]
Nhu, Quynh Nguyen Quang [2 ]
机构
[1] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi 62102, Taiwan
[2] Univ Da Nang, Univ Sci & Technol, Fac Adv Sci & Technol FAST, Da Nang 550000, Vietnam
[3] Wistron NeWeb Corp WNC, Hsinchu 300, Taiwan
关键词
Estimation; Streams; Feature extraction; Modulation; Heart rate; Monitoring; Convolutional neural networks; Computational modeling; Training; Convolution; Deep-learning (DL); facial image sequence; heart rate (HR) estimation; remote PPG; two-stream network; REMOTE; NONCONTACT;
D O I
10.1109/JSEN.2024.3483629
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article presents a deep-learning-based two-stream network to estimate remote Photoplethysmogram (rPPG) signal and hence derive the heart rate (HR) from an RGB facial video. Our proposed network employs temporal modulation blocks (TMBs) to efficiently extract temporal dependencies and spatial attention blocks on a mean frame to learn spatial features. Our TMBs are composed of two subblocks that can simultaneously learn overall and channelwise spatiotemporal features, which are pivotal for the task. Data augmentation (DA) in training and multiple redundant estimations for noise removal in testing were also designed to make the training more effective and the inference more robust. Experimental results show that the proposed temporal shift-channelwise spatio-temporal network (TS-CST Net) has reached competitive and even superior performances among the state-of-the-art (SOTA) methods on four popular datasets, showcasing our network's learning capability.
引用
收藏
页码:42343 / 42351
页数:9
相关论文
共 51 条
[1]   DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks [J].
Chen, Weixuan ;
McDuff, Daniel .
COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 :356-373
[2]   Remote Heart Rate Measurement From Near-Infrared Videos Based on Joint Blind Source Separation With Delay-Coordinate Transformation [J].
Cheng, Juan ;
Wang, Ping ;
Song, Rencheng ;
Liu, Yu ;
Li, Chang ;
Liu, Yong ;
Chen, Xun .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70 (70)
[3]   Illumination Variation-Resistant Video-Based Heart Rate Measurement Using Joint Blind Source Separation and Ensemble Empirical Mode Decomposition [J].
Cheng, Juan ;
Chen, Xun ;
Xu, Lingxi ;
Wang, Z. Jane .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2017, 21 (05) :1422-1433
[4]   Non-Contact Heart Rate Measurement From Facial Video Data Using a 2D-VMD Scheme [J].
Das, Mousumi ;
Choudhary, Tilendra ;
Bhuyan, M. K. ;
Sharma, L. N. .
IEEE SENSORS JOURNAL, 2022, 22 (11) :11153-11161
[5]   Robust Pulse Rate From Chrominance-Based rPPG [J].
de Haan, Gerard ;
Jeanne, Vincent .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2013, 60 (10) :2878-2886
[6]  
Dosso YS, 2018, IEEE INT SYM MED MEA, P642
[7]   SlowFast Networks for Video Recognition [J].
Feichtenhofer, Christoph ;
Fan, Haoqi ;
Malik, Jitendra ;
He, Kaiming .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6201-6210
[8]   The Way to my Heart is through Contrastive Learning: Remote Photoplethysmography from Unlabelled Video [J].
Gideon, John ;
Stent, Simon .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3975-3984
[9]  
Guo YH, 2019, AAAI CONF ARTIF INTE, P8368
[10]   Robust Heart Rate Estimation With Spatial-Temporal Attention Network From Facial Videos [J].
Hu, Min ;
Qian, Fei ;
Wang, Xiaohua ;
He, Lei ;
Guo, Dong ;
Ren, Fuji .
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) :639-647