RADIANT: Better rPPG estimation using signal embeddings and Transformer

被引：38

作者：

Gupta, Anup Kumar ^{[1
]}

Kumar, Rupesh ^{[1
]}

Birla, Lokendra ^{[1
]}

Gupta, Puneet ^{[1
]}

机构：

[1] Indian Inst Technol Indore, Indore, India

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00495

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Remote photoplethysmography can provide non-contact heart rate (HR) estimation by analyzing the skin color variations obtained from face videos. These variations are subtle, imperceptible to human eyes, and easily affected by noise. Existing deep learning-based rPPG estimators are incompetent due to three reasons. Firstly, they suppress the noise by utilizing information from the whole face even though different facial regions contain different noise characteristics. Secondly, local noise characteristics inherently affect the convolutional neural network (CNN) architectures. Lastly, the CNN sequential architectures fail to preserve long temporal dependencies. To address these issues, we propose RADIANT, that is, rPPG estimation using Signal Embeddings and Transformer. Our architecture utilizes a multi-head attention mechanism that facilitates feature subspace learning to extract the multiple correlations among the color variations corresponding to the periodic pulse. Also, its global information processing ability helps to suppress local noise characteristics. Furthermore, we propose novel signal embedding to enhance the rPPG feature representation and suppress noise. We have also improved the generalization of our architecture by adding a new training set. To this end, the effectiveness of synthetic temporal signals and data augmentations were explored. Experiments on extensively utilized rPPG datasets demonstrate that our architecture outperforms previous well-known architectures. Code: https://github.com/Deep-Intelligence-Lab/RADIANT.git

引用

页码：4965 / 4975

页数：11

共 62 条

[1]

[Anonymous], 2002, INT C IM PROC

[2]

[Anonymous], 2020, EUR C COMP VIS, DOI DOI 10.2112/SI102-005.1

[3]

[Anonymous], 2021, IEEE T AFFECTIVE COM, DOI DOI 10.1057/S41310-021-00114-4

[4]

Ba J.L., 2016, Layer Normalization

[5] Detecting Pulse from Head Motions in Video [J].

Balakrishnan, Guha ;

Durand, Fredo ;

Guttag, John .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3430-3437

[6] PATRON: Exploring respiratory signal derived from non-contact face videos for face anti-spoofing [J].

Birla, Lokendra ;

Gupta, Puneet .

EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187

[7]

Birla Lokendra, 2022, IEEE T DEPENDABLE SE

[8]

Birla Lokendra, 2021, COMPUTERS BIOL MED

[9]

Bobbia Serge, 2019, Pattern Recognition Letters, P82, DOI DOI 10.1016/J.PATREC.2017.10.017

[10] 3D Convolutional Neural Networks for Remote Pulse Rate Measurement and Mapping from Facial Video [J].

Bousefsaf, Frederic ;

Pruski, Alain ;

Maaoui, Choubeila .

APPLIED SCIENCES-BASEL, 2019, 9 (20)

← 1 2 3 4 5 6 7 →