Remote Heart Rate Measurement from Highly Compressed Facial Videos: an End-to-end Deep Learning Solution with Video Enhancement

被引:222
作者
Yu, Zitong [1 ]
Peng, Wei [1 ]
Li, Xiaobai [1 ]
Hong, Xiaopeng [1 ,2 ,4 ]
Zhao, Guoying [1 ,3 ]
机构
[1] Univ Oulu, Ctr Machine Vis & Signal Anal, Oulu, Finland
[2] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Xian, Peoples R China
[3] Northwest Univ, Sch Informat & Technol, Xian, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
基金
中国国家自然科学基金; 芬兰科学院;
关键词
NONCONTACT;
D O I
10.1109/ICCV.2019.00024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Remote photoplethysmography (rPPG), which aims at measuring heart activities without any contact, has great potential in many applications (e.g., remote healthcare). Existing rPPG approaches rely on analyzing very fine details of facial videos, which are prone to be affected by video compression. Here we propose a two-stage, end-to-end method using hidden rPPG information enhancement and attention networks, which is the first attempt to counter video compression loss and recover rPPG signals from highly compressed videos. The method includes two parts: 1) a Spatio-Temporal Video Enhancement Network (STVEN) for video enhancement, and 2) an rPPG network (rPPGNet) for rPPG signal recovery. The rPPGNet can work on its own for robust rPPG measurement, and the STVEN network can be added and jointly trained to further boost the performance especially on highly compressed videos. Comprehensive experiments are performed on two benchmark datasets to show that, 1) the proposed method not only achieves superior performance on compressed videos with high-quality videos pair, 2) it also generalizes well on novel data with only compressed videos available, which implies the promising potential for real-world applications.
引用
收藏
页码:151 / 160
页数:10
相关论文
共 40 条
[1]  
[Anonymous], 2018, PROC IEEE C COMPUT V
[2]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.263
[3]  
[Anonymous], 2017, IEEE T AFFECTIVE COM
[4]  
[Anonymous], 2007, P 3 INT WORKSH VID P
[5]  
[Anonymous], 2017, COMP VIS ICCV 2017 I
[6]   Multi-task Convolutional Neural Network for Patient Detection and Skin Segmentation in Continuous Non-contact Vital Sign Monitoring [J].
Chaichulee, Sitthichok ;
Villarroel, Mauricio ;
Jorge, Joao ;
Arteta, Carlos ;
Green, Gabrielle ;
McCormick, Kenny ;
Zisserman, Andrew ;
Tarassenko, Lionel .
2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, :266-272
[7]  
Chen W., 2018, PROC EUR C COMPUT VI, P349
[8]   Robust Pulse Rate From Chrominance-Based rPPG [J].
de Haan, Gerard ;
Jeanne, Vincent .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2013, 60 (10) :2878-2886
[9]   Compression Artifacts Reduction by a Deep Convolutional Network [J].
Dong, Chao ;
Deng, Yubin ;
Loy, Chen Change ;
Tang, Xiaoou .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :576-584
[10]  
Galteri Leonardo, 2017, ICCV