Remote heart rate estimation via convolutional neural networks with transformers

被引:2
作者
Guo, Zongheng [1 ]
Chen, Huahua [1 ]
Lin, Lili [2 ]
Zhou, Wenhui [3 ]
Yang, Meng [1 ]
Ying, Na [1 ]
Guo, Chunsheng [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou, Peoples R China
[2] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Peoples R China
来源
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS | 2023年 / 360卷 / 17期
关键词
NONCONTACT;
D O I
10.1016/j.jfranklin.2023.10.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Remote photoplethysmography (rPPG) uses subtle color changes in facial videos to estimate heart rate(HR). However, recent methods face challenges in solving remote heart rate estimation tasks, because the color changes in facial skin are very subtle and the pseudo-periodicity of rPPG requires long-distance temporal detection. To address these issues of the rPPG estimation task, we propose a convolution neural network with transformers for rPPG estimation which takes the advantages of convolutions in the locality and transformers in long-range dependencies. Specifically, we first proposed a local feed-forward module following the multi-head self-attention to compensate for the difficulties of transformers in capturing neighboring feature information. Then we add the relative and absolute position encoding to obtain the ordering of the tokens which is the key to capturing the pseudo-periodicity of rPPG. Furthermore, the temporal multi-scale module is proposed to learn the temporal information from different scales. Extensive experimental results demonstrate that our method is superior to the state-of-the-art results on the COHFACE, UBFC-rPPG, PURE, and VIPL-HR databases compared with traditional methods and deep learning-based methods. We also evaluated our method under different conditions on the VIPL-HR database, and the results show that our method is robust to various conditions. (c) 2023 The Franklin Institute. Published by Elsevier Inc. All rights reserved.
引用
收藏
页码:13149 / 13165
页数:17
相关论文
共 48 条
  • [1] Unsupervised skin tissue segmentation for remote photoplethysmography
    Bobbia, Serge
    Macwan, Richard
    Benezeth, Yannick
    Mansouri, Alamin
    Dubois, Julien
    [J]. PATTERN RECOGNITION LETTERS, 2019, 124 : 82 - 90
  • [2] 3D Convolutional Neural Networks for Remote Pulse Rate Measurement and Mapping from Facial Video
    Bousefsaf, Frederic
    Pruski, Alain
    Maaoui, Choubeila
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (20):
  • [3] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [4] DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks
    Chen, Weixuan
    McDuff, Daniel
    [J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 356 - 373
  • [5] Robust Pulse Rate From Chrominance-Based rPPG
    de Haan, Gerard
    Jeanne, Vincent
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2013, 60 (10) : 2878 - 2886
  • [6] Gehring J, 2017, PR MACH LEARN RES, V70
  • [7] Gideon John, 2021, Proceedings of the IEEE/CVF international conference on computer vision, P3995
  • [8] CMT: Convolutional Neural Networks Meet Vision Transformers
    Guo, Jianyuan
    Han, Kai
    Wu, Han
    Tang, Yehui
    Chen, Xinghao
    Wang, Yunhe
    Xu, Chang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12165 - 12175
  • [9] MOMBAT: Heart rate monitoring from face video using pulse modeling and Bayesian tracking
    Gupta, Puneet
    Bhowmick, Brojeshwar
    Pal, Arpan
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 121
  • [10] Bag of Tricks for Image Classification with Convolutional Neural Networks
    He, Tong
    Zhang, Zhi
    Zhang, Hang
    Zhang, Zhongyue
    Xie, Junyuan
    Li, Mu
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 558 - 567