A Real-Time and Long-Term Face Tracking Method Using Convolutional Neural Network and Optical Flow in IoT-Based Multimedia Communication Systems

被引:1
作者
Ren, Hanchi [1 ,2 ]
Hu, Yi [2 ]
Myint, San Hlaing [3 ]
Hou, Kun [1 ]
Zhang, Xiuyu [4 ]
Zuo, Min [1 ]
Zhang, Chi [2 ]
Zhang, Qingchuan [1 ]
Li, Haipeng [5 ]
机构
[1] Beijing Technol & Business Univ, Natl Engn Lab Agri Product Qual Traceabil, Beijing 100048, Peoples R China
[2] Swansea Univ, Swansea SA1 8EN, W Glam, Wales
[3] Waseda Univ, Global Informat & Telecommun Inst, Tokyo 1698050, Japan
[4] Guizhou Acad Testing & Anal, Guiyang 550000, Peoples R China
[5] Capinfo Co Ltd, Beijing 100010, Peoples R China
基金
北京市自然科学基金;
关键词
INTERNET; FEATURES;
D O I
10.1155/2021/6711561
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The development of the Internet of Things (IoT) stimulates many research works related to Multimedia Communication Systems (MCS), such as human face detection and tracking. This trend drives numerous progressive methods. Among these methods, the deep learning-based methods can spot face patch in an image effectively and accurately. Many people consider face tracking as face detection, but they are two different techniques. Face detection focuses on a single image, whose shortcoming is obvious, such as unstable and unsmooth face position when adopted on a sequence of continuous images; computing is expensive due to its heavy reliance on Convolutional Neural Networks (CNNs) and limited detection performance on the edge device. To overcome these defects, this paper proposes a novel face tracking strategy by combining CNN and optical flow, namely, C-OF, which achieves an extremely fast, stable, and long-term face tracking system. Two key things for commercial applications are the stability and smoothness of face positions in a sequence of image frames, which can provide more probability for face biological signal extraction, silent face antispoofing, and facial expression analysis in the fields of IoT-based MCS. Our method captures face patterns in every two consequent frames via optical flow to get rid of the unstable and unsmooth problems. Moreover, an innovative metric for measuring the stability and smoothness of face motion is designed and adopted in our experiments. The experimental results illustrate that our proposed C-OF outperforms both face detection and object tracking methods.
引用
收藏
页数:15
相关论文
共 65 条
[1]  
[Anonymous], 2022, IEEE T NETW SCI ENG, DOI DOI 10.1109/TNSE.2021.3101842
[2]  
[Anonymous], 2015, 2015 INT C BIOM SPEC, DOI DOI 10.1109/BIOSIG.2015.7314604
[3]  
Bolme DS, 2010, PROC CVPR IEEE, P2544, DOI 10.1109/CVPR.2010.5539960
[4]  
Bouguet J.-Y., 2001, Intel corporation, V5, P4, DOI DOI 10.1109/ICETET.2009.154
[5]   Real time face and object tracking as a component of a perceptual user interface [J].
Bradski, GR .
FOURTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION - WACV'98, PROCEEDINGS, 1998, :214-219
[6]   On the design of Cascades of boosted ensembles for face detection [J].
Brubaker, S. Charles ;
Wu, Jianxin ;
Sun, Jie ;
Mullin, Matthew D. ;
Rehg, James M. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 77 (1-3) :65-86
[7]   Multi-modal tracking of faces for video communications [J].
Crowley, JL ;
Berard, F .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :640-645
[8]   Discriminative Scale Space Tracking [J].
Danelljan, Martin ;
Hager, Gustav ;
Khan, Fahad Shahbaz ;
Felsberg, Michael .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) :1561-1575
[9]   Nested Shallow CNN-Cascade for Face Detection in the Wild [J].
Deng, Jingjing ;
Xie, Xianghua .
2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, :165-172
[10]   Deep-Learning-Empowered Digital Forensics for Edge Consumer Electronics in 5G HetNets [J].
Ding, Feng ;
Zhu, Guopu ;
Alazab, Mamoun ;
Li, Xiangjun ;
Yu, Keping .
IEEE CONSUMER ELECTRONICS MAGAZINE, 2022, 11 (02) :42-50