AFNET-M: ADAPTIVE FUSION NETWORK WITH MASKS FOR 2D+3D FACIAL EXPRESSION RECOGNITION

被引:1
|
作者
Sui, Mingzhe [1 ]
Li, Hanting [1 ]
Zhu, Zhaoqing [1 ]
Zhao, Feng [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
关键词
2D+3D facial expression recognition; mask attention; adaptive fusion; AFNet-M; FACE;
D O I
10.1109/ICIP49359.2023.10222441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
2D+3D facial expression recognition (FER) can effectively cope with illumination and pose changes by merging texture and robust depth information. Most deep learning-based approaches employ the simple fusion strategy that concatenates the multimodal features directly after fully-connected layers, without considering the different degrees of significance for each modality. Meanwhile, how to focus more on both 2D and 3D local features is still a great challenge. In this paper, we propose the adaptive fusion network with masks (AFNet-M) for 2D+3D FER. To enhance 2D and 3D local features, we take the masks annotating salient regions of the face as prior knowledge and design the mask attention module (MA) which can automatically learn two modulation vectors to scale the feature maps. We also introduce an adaptive fusion module (AF) at convolutional layers through the computed importance weights. Experimental results demonstrate that our AFNet-M achieves the state-of-the-art performance on BU-3DFE and Bosphorus datasets and requires fewer parameters in comparison with other models.
引用
收藏
页码:116 / 120
页数:5
相关论文
共 50 条
  • [31] Multimodal 2D and 3D for In-the-wild Facial Expression Recognition
    Ly, Son Thai
    Nhu-Tai Do
    Lee, Guee-Sang
    Kim, Soo-Hyung
    Yang, Hyung-Jeong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2927 - 2934
  • [32] Automatic 3D Facial Expression Recognition using Geometric and Textured Feature Fusion
    Jan, Asim
    Meng, Hongying
    2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG): EMOTION REPRESENTATION, ANALYSIS AND SYNTHESIS IN CONTINUOUS TIME AND SPACE (EMOSPACE 2015), VOL 5, 2015,
  • [33] Predicting Human Intentions from Motion Cues Only: A 2D+3D Fusion Approach
    Zunino, Andrea
    Cavazza, Jacopo
    Koul, Atesh
    Cavallo, Andrea
    Becchio, Cristina
    Murino, Vittorio
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 591 - 599
  • [34] 2D-to-3D Facial Expression Transfer
    Rotger, Gemma
    Lumbreras, Felipe
    Moreno-Noguer, Francesc
    Agudo, Antonio
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2008 - 2013
  • [35] Facial Expression Study Based on 3D Facial Emotion Recognition
    Cao, HongYuan
    Qi, Chao
    20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS), 2021, : 375 - 381
  • [36] Facial expression recognition using 3D facial feature distances
    Soyel, Hamit
    Demirel, Hasan
    IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2007, 4633 : 831 - 838
  • [37] Expressive Maps for 3D Facial Expression Recognition
    Ocegueda, Omar
    Fang, Tianhong
    Shah, Shishir K.
    Kakadiaris, Ioannis A.
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [38] 3D Facial Expression Recognition Using Residues
    Srivastava, Ruchir
    Roy, Sujoy
    TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 1160 - +
  • [39] 3D Facial Expression Recognition with Geometrically Localized Facial Features
    Soyel, Hamit
    Demirel, Hasan
    23RD INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2008, : 229 - 232
  • [40] Effects on facial expression in 3D face recognition
    Chang, K
    Bowyer, K
    Flynn, P
    BIOMETRIC TECHNOLOGY FOR HUMAN IDENTIFICATION II, 2005, 5779 : 132 - 143