Multi-scale attention guided network for end-to-end face alignment and recognition

被引:5
|
作者
Shakeel, M. Saad [1 ]
Zhang, Yuxuan [2 ]
Wang, Xin [2 ]
Kang, Wenxiong [2 ]
Mahmood, Arif [3 ]
机构
[1] Guangdong Univ Petrochem Technol, Sch Automat, Maoming 525000, Peoples R China
[2] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
[3] Informat Technol Univ, Dept Comp Sci, Lahore 54000, Pakistan
关键词
Attention network; Feature alignment; Multi-scale features; Adaptive feature fusion;
D O I
10.1016/j.jvcir.2022.103628
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Attention modules embedded in deep networks mediate the selection of informative regions for object recog-nition. In addition, the combination of features learned from different branches of a network can enhance the discriminative power of these features. However, fusing features with inconsistent scales is a less-studied problem. In this paper, we first propose a multi-scale channel attention network with an adaptive feature fusion strategy (MSCAN-AFF) for face recognition (FR), which fuses the relevant feature channels and improves the network's representational power. In FR, face alignment is performed independently prior to recognition, which requires the efficient localization of facial landmarks, which might be unavailable in uncontrolled scenarios such as low-resolution and occlusion. Therefore, we propose utilizing our MSCAN-AFF to guide the Spatial Transformer Network (MSCAN-STN) to align feature maps learned from an unaligned training set in an end-to -end manner. Experiments on benchmark datasets demonstrate the effectiveness of our proposed MSCAN-AFF and MSCAN-STN.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Efficient Neural Network for Text Recognition in Natural Scenes Based on End-to-End Multi-Scale Attention Mechanism
    Peng, Huiling
    Yu, Jia
    Nie, Yalin
    ELECTRONICS, 2023, 12 (06)
  • [2] MSARN: A Multi-scale Attention Residual Network for End-to-End Environmental Sound Classification
    Fucai Hu
    Peng Song
    Ruhan He
    Zhaoli Yan
    Yongsheng Yu
    Neural Processing Letters, 2023, 55 : 11449 - 11465
  • [3] MSARN: A Multi-scale Attention Residual Network for End-to-End Environmental Sound Classification
    Hu, Fucai
    Song, Peng
    He, Ruhan
    Yan, Zhaoli
    Yu, Yongsheng
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 11449 - 11465
  • [4] An end-to-end face recognition method with alignment learning
    Tang, Fenggao
    Wu, Xuedong
    Zhu, Zhiyu
    Wan, Zhengang
    Chang, Yanchao
    Du, Zhaoping
    Gu, Lili
    OPTIK, 2020, 205
  • [5] A Convolutional Network With Multi-Scale and Attention Mechanisms for End-to-End Single-Channel Speech Enhancement
    Xiang, Xiaoxiao
    Zhang, Xiaojuan
    Chen, Haozhe
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1455 - 1459
  • [6] Toward End-to-End Face Recognition Through Alignment Learning
    Zhong, Yuanyi
    Chen, Jiansheng
    Huang, Bo
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (08) : 1213 - 1217
  • [7] Multi-Attention-Guided Cascading Network for End-to-End Person Search
    Yang, Jianxi
    Wang, Xiaoyong
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [8] End-to-End Speech Emotion Recognition Using Multi-Scale Convolution Networks
    Sivanagaraja, Tatinati
    Ho, Mun Kit
    Khong, Andy W. H.
    Wang, Yubo
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 189 - 192
  • [9] End-to-End Edge-Guided Multi-Scale Matching Network for Optical Satellite Stereo Image Pairs
    Luo, Yixin
    Wang, Hao
    Lv, Xiaolei
    REMOTE SENSING, 2024, 16 (05)
  • [10] AN END-TO-END MULTI-SCALE RESIDUAL RECONSTRUCTION NETWORK FOR IMAGE COMPRESSIVE SENSING
    Liu, Renhe
    Li, Sumei
    Hou, Chunping
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2070 - 2074