HEART RATE AND OXYGEN SATURATION ESTIMATION FROM FACIAL VIDEO WITH MULTIMODAL PHYSIOLOGICAL DATA GENERATION

被引:3
作者
Akamatsu, Yusuke [1 ]
Onishi, Yoshifumi [1 ]
Imaoka, Hitoshi [1 ]
机构
[1] NEC Corp Ltd, Biometr Res Labs, Tokyo, Japan
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Telemedicine; remote photoplethysmography (rPPG); heart rate; oxygen saturation; multimodal generative model;
D O I
10.1109/ICASSP43922.2022.9747109
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Efforts to estimate multiple physiological parameters such as heart rate and oxygen saturation from facial videos have been made. However, training robust machine learning models for the estimation is challenging without large multimodal physiological datasets containing multiple physiological parameters and facial videos. In this paper, we propose a method to estimate heart rate and oxygen saturation from facial videos with multimodal physiological data generation. To collect sufficient datasets, the proposed method generates multimodal physiological datasets from several datasets containing a part of physiological modalities. Furthermore, to accurately estimate physiological parameters for unseen subjects, i:e:, not included in the training data, we generate a multimodal physiological dataset for unseen subjects by using short facial videos of unseen subjects. Experimental results using three public datasets show the effectiveness of our multimodal physiological data generation.
引用
收藏
页码:1111 / 1115
页数:5
相关论文
共 21 条
[1]   CLASSIFICATION OF EXPERT-NOVICE LEVEL USING EYE TRACKING AND MOTION DATA VIA CONDITIONAL MULTIMODAL VARIATIONAL AUTOENCODER [J].
Akamatsu, Yusuke ;
Maeda, Keisuke ;
Ogawa, Takahiro ;
Haseyama, Miki .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :1360-1364
[2]  
[Anonymous], 2018, ARXIV180205335
[3]  
Boccignone Giuseppe, 2020, IEEE ACCESS, V8
[4]  
Casalino G, 2020, IEEE SYMP COMP COMMU, P823
[5]   DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks [J].
Chen, Weixuan ;
McDuff, Daniel .
COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 :356-373
[6]   Robust Pulse Rate From Chrominance-Based rPPG [J].
de Haan, Gerard ;
Jeanne, Vincent .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2013, 60 (10) :2878-2886
[7]  
Kingma DP, 2014, ADV NEUR IN, V27
[8]  
Kopeliovich M., 2019, P IEEE CVF INT C COM
[9]  
Lewandowska M., 2011, 2011 Federated Conference on Computer Science and Information Systems (FedCSIS), P405
[10]  
Liu Xin, 2020, ARXIV200603790