A Two-Stream Deep-Learning Network for Heart Rate Estimation From Facial Image Sequence

被引:1
作者
Lie, Wen-Nung [1 ]
Le, Dao Q. [1 ,2 ,3 ]
Huang, Po-Han [1 ]
Fu, Guan-Hao [1 ]
Quynh, Anh Nguyen Thi [2 ]
Nhu, Quynh Nguyen Quang [2 ]
机构
[1] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi 62102, Taiwan
[2] Univ Da Nang, Univ Sci & Technol, Fac Adv Sci & Technol FAST, Da Nang 550000, Vietnam
[3] Wistron NeWeb Corp WNC, Hsinchu 300, Taiwan
关键词
Estimation; Streams; Feature extraction; Modulation; Heart rate; Monitoring; Convolutional neural networks; Computational modeling; Training; Convolution; Deep-learning (DL); facial image sequence; heart rate (HR) estimation; remote PPG; two-stream network; REMOTE; NONCONTACT;
D O I
10.1109/JSEN.2024.3483629
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article presents a deep-learning-based two-stream network to estimate remote Photoplethysmogram (rPPG) signal and hence derive the heart rate (HR) from an RGB facial video. Our proposed network employs temporal modulation blocks (TMBs) to efficiently extract temporal dependencies and spatial attention blocks on a mean frame to learn spatial features. Our TMBs are composed of two subblocks that can simultaneously learn overall and channelwise spatiotemporal features, which are pivotal for the task. Data augmentation (DA) in training and multiple redundant estimations for noise removal in testing were also designed to make the training more effective and the inference more robust. Experimental results show that the proposed temporal shift-channelwise spatio-temporal network (TS-CST Net) has reached competitive and even superior performances among the state-of-the-art (SOTA) methods on four popular datasets, showcasing our network's learning capability.
引用
收藏
页码:42343 / 42351
页数:9
相关论文
共 51 条
[21]  
Liu Xin, 2020, Advances in Neural Information Processing Systems, V33
[22]   CMRPPGFormer: 3-D Spatio-Temporal Convolutional Modulation Transformer Network for Remote Heart Rate Estimation [J].
Ma, Xiaolin ;
Wang, Zhaosen ;
Liu, Xinhua ;
Kuang, Hailan .
IEEE SENSORS JOURNAL, 2024, 24 (19) :30275-30286
[23]   CPulse: Heart Rate Estimation From RGB Videos Under Realistic Conditions [J].
Mehta, Arya Deo ;
Sharma, Hemant .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[24]   Cardiorespiratory Parameters Monitoring Through a Single Digital Camera in Real Scenarios: ROI Tracking and Motion Influence [J].
Molinaro, Nunzia ;
Zangarelli, Federico ;
Schena, Emiliano ;
Silvestri, Sergio ;
Massaroni, Carlo .
IEEE SENSORS JOURNAL, 2023, 23 (17) :20097-20106
[25]   Video-Based Remote Physiological Measurement via Cross-Verified Feature Disentangling [J].
Niu, Xuesong ;
Yu, Zitong ;
Han, Hu ;
Li, Xiaobai ;
Shan, Shiguang ;
Zhao, Guoying .
COMPUTER VISION - ECCV 2020, PT II, 2020, 12347 :295-310
[26]   RhythmNet: End-to-End Heart Rate Estimation From Face via Spatial-Temporal Representation [J].
Niu, Xuesong ;
Shan, Shiguang ;
Han, Hu ;
Chen, Xilin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :2409-2423
[27]  
Niu XS, 2018, INT C PATT RECOG, P3580, DOI 10.1109/ICPR.2018.8546321
[28]   Advancements in Noncontact, Multiparameter Physiological Measurements Using a Webcam [J].
Poh, Ming-Zher ;
McDuff, Daniel J. ;
Picard, Rosalind W. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2011, 58 (01) :7-11
[29]   EVM-CNN: Real-Time Contactless Heart Rate Estimation From Facial Video [J].
Qiu, Ying ;
Liu, Yang ;
Arteaga-Falconi, Juan ;
Dong, Haiwei ;
El Saddik, Abdulmotaleb .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (07) :1778-1787
[30]   Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks [J].
Qiu, Zhaofan ;
Yao, Ting ;
Mei, Tao .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5534-5542