Multimodal facial biometrics recognition: Dual-stream convolutional neural networks with multi-feature fusion layers

被引:27
|
作者
Tiong, Leslie Ching Ow [1 ]
Kim, Seong Tae [2 ]
Ro, Yong Man [3 ]
机构
[1] Korea Inst Sci & Technol KIST, Computat Sci Res Ctr, 5 Hwarang Ro,14 Gil Seongbuk Gu, Seoul 02792, South Korea
[2] Tech Univ Munich, Comp Aided Med Procedures, Boltzmanstr 3, D-85748 Garching, Germany
[3] Korea Adv Inst Sci & Technol KAIST, Image & Video Syst Lab, 291 Daehak Ro, Daejeon 34141, South Korea
关键词
Multimodal facial biometrics recognition; Deep multimodal learning; Dual-stream convolutional neural network; Network fusion layers;
D O I
10.1016/j.imavis.2020.103977
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial recognition for surveillance applications still remains challenging in uncontrolled environments, especially with the appearances of masks/veils and different ethnicities effects. Multimodal facial biometrics recognition becomes one of the major studies to overcome such scenarios. However, to cooperate with multimodal facial biometrics, many existing deep learning networks rely on feature concatenation or weight combination to construct a representation layer to perform its desired recognition task. This concatenation is often inefficient, as it does not effectively cooperate with the multimodal data to improve on recognition performance. Therefore, this paper proposes using multi-feature fusion layers for multi modal facial biometrics, thereby leading to significant and informative data learning in dual-stream convolutional neural networks. Specifically, this network consists of two progressive parts with distinct fusion strategies to aggregate RGB data and texture descriptors for multimodal facial biometrics. We demonstrate that the proposed network offers a discriminative feature representation and benefits from the multi-feature fusion layers for an accuracy-performance gain. We also introduce and share a new dataset for multimodal facial biometric data, namely the Ethnic-facial dataset for benchmarking. In addition, four publicly accessible datasets, namely AR. FaceScrub, IMDB_WIKI, and YouTube Face datasets are used to evaluate the proposed network. Through our experimental analysis, the proposed network outperformed several competing networks on these datasets for both recognition and verification tasks. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A new multi-feature fusion based convolutional neural network for facial expression recognition
    Zou, Wei
    Zhang, Dong
    Lee, Dah-Jye
    APPLIED INTELLIGENCE, 2022, 52 (03) : 2918 - 2929
  • [2] A new multi-feature fusion based convolutional neural network for facial expression recognition
    Wei Zou
    Dong Zhang
    Dah-Jye Lee
    Applied Intelligence, 2022, 52 : 2918 - 2929
  • [3] Feature Fusion for Dual-Stream Cooperative Action Recognition
    Chen, Dong
    Wu, Mengtao
    Zhang, Tao
    Li, Chuanqi
    IEEE ACCESS, 2023, 11 : 116732 - 116740
  • [4] A DUAL-STREAM CONVOLUTIONAL FEATURE FUSION NETWORK FOR HYPERSPECTRAL UNMIXING
    Hua, Haoyue
    Li, Jie
    Wang, Ying
    Gao, Xinbo
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 7531 - 7534
  • [5] Multi-feature fusion gesture recognition based on deep convolutional neural network
    Yun Wei-guo
    Shi Qi-qi
    Wang Min
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2019, 34 (04) : 417 - 422
  • [6] Semantic Segmentation of Images Based on Multi-Feature Fusion and Convolutional Neural Networks
    Wang, Zhenyu
    Xiao, Juan
    Zhang, Shuai
    Qi, Baoqiang
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (06)
  • [7] Implementation of multimodal biometric recognition via multi-feature deep learning networks and feature fusion
    Tiong, Leslie Ching Ow
    Kim, Seong Tae
    Ro, Yong Man
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (16) : 22743 - 22772
  • [8] Implementation of multimodal biometric recognition via multi-feature deep learning networks and feature fusion
    Leslie Ching Ow Tiong
    Seong Tae Kim
    Yong Man Ro
    Multimedia Tools and Applications, 2019, 78 : 22743 - 22772
  • [9] A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention
    Han, Jianmin
    Li, Jie
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2024, 32 (04) : 673 - 694
  • [10] DUAL-STREAM SHALLOW NETWORKS FOR FACIAL MICRO-EXPRESSION RECOGNITION
    Khor, Huai-Qian
    See, John
    Liong, Sze-Teng
    Phan, Raphael C. W.
    Lin, Weiyao
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 36 - 40