Accurate Remote PPG Waveform Recovery from Video Using a Multi-task Learning Temporal Model

被引:0
作者
Zhou, Fangshi [1 ]
Zhao, Tianming [1 ]
Holsinger, Adam [1 ,2 ]
Yao, Zhongmei [1 ]
机构
[1] Univ Dayton, Dept Comp Sci, Dayton, OH 45469 USA
[2] AFRL RYWA, Resilient & Agile Avion Branch, Force Res Lab, 2241 Avionics Circle Bldg 620, Wright Patterson AFB, OH 45433 USA
来源
ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT I | 2025年 / 15046卷
关键词
Remote PPG; Waveform Recovery; Multi-task Learning; Peak and Trough Losses; Signal-to-Noise Ratio Loss;
D O I
10.1007/978-3-031-77392-1_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Remote photoplethysmography (rPPG) offers a convenient, non-contact method for extracting cardiac-related signals from video. Despite its significant potential for comprehensive cardiac health monitoring, existing methods are limited to extracting heart rate because they can only recover heart-rate-correlated periodic patterns rather than the complete and precise PPG waveform needed for thorough biometric analysis. To address this issue, we designed a multi-loss model aimed at accurately restoring rPPG waveforms, focusing on capturing critical fiducial points and pulse contours. Our model employs a multi-task learning architecture that integrates primary rPPG signal reconstruction mean squared error (MSE) loss, peak loss, trough loss, and signal-to-noise ratio (SNR) loss to enhance signal recovery. Additionally, we incorporated Temporal Shift Modules (TSM) and Long Short-Term Memory (LSTM) networks to capture both short-term and long-term temporal dependencies, effectively handling low-quality or cross-dataset training data. The experimental results show that our model significantly improves rPPG signal restoration on the PURE and UBFC-rPPG datasets, outperforming two representative models, DeepPhys and TS-CAN, by reducing systolic peak and foot/onset estimation errors by over 30%, accurately capturing diastolic peaks and dicrotic notches, and achieving a DTW distance of 6.54, indicating enhanced waveform contour recovery.
引用
收藏
页码:493 / 506
页数:14
相关论文
共 18 条
[1]   Unsupervised skin tissue segmentation for remote photoplethysmography [J].
Bobbia, Serge ;
Macwan, Richard ;
Benezeth, Yannick ;
Mansouri, Alamin ;
Dubois, Julien .
PATTERN RECOGNITION LETTERS, 2019, 124 :82-90
[2]   DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks [J].
Chen, Weixuan ;
McDuff, Daniel .
COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 :356-373
[3]   Prediction of vascular aging based on smartphone acquired PPG signals [J].
Dall'Olio, Lorenzo ;
Curti, Nico ;
Remondini, Daniel ;
Harb, Yosef Safi ;
Asselbergs, Folkert W. ;
Castellani, Gastone ;
Uh, Hae-Won .
SCIENTIFIC REPORTS, 2020, 10 (01)
[4]   Evaluation of biases in remote photoplethysmography methods [J].
Dasari, Ananyananda ;
Prakash, Sakthi Kumar Arul ;
Jeni, Laszlo A. ;
Tucker, Conrad S. .
NPJ DIGITAL MEDICINE, 2021, 4 (01)
[5]  
empeek, How medical device companies are transforming cardiac care with remote monitoring
[6]   DensePose: Dense Human Pose Estimation In The Wild [J].
Guler, Riza Alp ;
Neverova, Natalia ;
Kokkinos, Lasonas .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7297-7306
[7]   Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics [J].
Kendall, Alex ;
Gal, Yarin ;
Cipolla, Roberto .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7482-7491
[8]  
Liu Xiaoyu, 2024, Adv. Neural Inf. Process. Syst., V36
[9]  
Liu Xin, 2020, Advances in Neural Information Processing Systems, V33
[10]   Dual-GAN: Joint BVP and Noise Modeling for Remote Physiological Measurement [J].
Lu, Hao ;
Han, Hu ;
Zhou, S. Kevin .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12399-12408