Investigating Nuisances in DCNN-Based Face Recognition

被引:8
|
作者
Ferrari, Claudio [1 ]
Lisanti, Giuseppe [2 ]
Berretti, Stefano [1 ]
Del Bimbo, Alberto [1 ]
机构
[1] Univ Florence, Dept Informat Engn, I-50139 Florence, Italy
[2] Univ Pavia, Dept Elect Comp & Biomed Engn, I-27100 Pavia, Italy
关键词
Face recognition; deep learning; CNN architecture; distance measures;
D O I
10.1109/TIP.2018.2861359
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Face recognition "in the wild" has been revolutionized by the deployment of deep learning-based approaches. In fact, it has been extensively demonstrated that deep convolutional neural networks (DCNNs) are powerful enough to overcome most of the limits that affected face recognition algorithms based on hand-crafted features. These include variations in illumination, pose, expression, and occlusion, to mention some. The DCNNs discriminative power comes from the fact that low- and high-level representations are learned directly from the raw image data. As a consequence, we expect the performance of a DCNN to be influenced by the characteristics of the image/video data that are fed to the network, and their preprocessing. In this paper, we present a thorough analysis of several aspects that impact on the use of DCNN for face recognition. The evaluation has been carried out from two main perspectives: the network architecture and the similarity measures used to compare deeply learned features; and the data (source and quality) and their pre-processing (bounding box and alignment). The results obtained on the IARPA Janus Benchmark-A, MegaFace, UMDFaces, and YouTube Faces data sets indicate viable hints for designing, training, and testing DCNNs. Considering the outcomes of the experimental evaluation, we show how competitive performance with respect to the state of the art can be reached even with standard DCNN architectures and pipeline.
引用
收藏
页码:5638 / 5651
页数:14
相关论文
共 50 条
  • [41] DCNN-based prediction model for detection of age-related macular degeneration from color fundus images
    Rivu Chakraborty
    Ankita Pramanik
    Medical & Biological Engineering & Computing, 2022, 60 : 1431 - 1448
  • [42] Dynahead-YOLO-Otsu: an efficient DCNN-based landslide semantic segmentation method using remote sensing images
    Han, Zheng
    Fu, Bangjie
    Fang, Zhenxiong
    Li, Yange
    Li, Jiaying
    Jiang, Nan
    Chen, Guangqi
    GEOMATICS NATURAL HAZARDS & RISK, 2024, 15 (01)
  • [43] Investigating the impact of face categorization on recognition performance
    Veropoulos, K
    Bebis, G
    Webster, M
    ADVANCES IN VISUAL COMPUTING, PROCEEDINGS, 2005, 3804 : 207 - 218
  • [44] Recognition of sound vibration by DCNN based on φ-OTDR system
    Chen, Cong
    Li, Jiamin
    Qin, Zujun
    Xiong, Xianming
    Zhang, Wentao
    AOPC 2021: MICRO-OPTICS AND MOEMS, 2021, 12066
  • [45] Speaker Recognition Based on 3DCNN-LSTM
    Hu, ZhangFang
    Si, XingTong
    Luo, Yuan
    Tang, ShanShan
    Jian, Fang
    ENGINEERING LETTERS, 2021, 29 (02) : 463 - 470
  • [46] Investigating Cascaded Face Quality Assessment for Practical Face Recognition System
    Kim, Hyung-Il
    Lee, Seung Ho
    Ro, Yong Man
    2014 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2014, : 399 - 400
  • [47] DCNN and DNN Based Multi-modal Depression Recognition
    Yang, Le
    Jiang, Dongmei
    Han, Wenjing
    Sahli, Hichem
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 484 - 489
  • [48] HRRP image recognition of midcourse ballistic targets based on DCNN
    Xiang Q.
    Wang X.
    Li R.
    Lai J.
    Zhang G.
    1600, Chinese Institute of Electronics (42): : 2426 - 2433
  • [49] Deception detection and emotion recognition: Investigating FACE software
    Curtis, Drew A.
    PSYCHOTHERAPY RESEARCH, 2021, 31 (06) : 802 - 816
  • [50] Investigating the stability of individual differences in face recognition behavior
    Arrington, Myles N.
    Scherf, K. Suzanne
    SCIENTIFIC REPORTS, 2025, 15 (01):