Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition

被引:3
|
作者
Xie, Weicheng [1 ]
Peng, Zhibin [1 ]
Shen, Linlin [1 ]
Lu, Wenya [1 ]
Zhang, Yang [1 ]
Song, Siyang [2 ]
机构
[1] Shenzhen Univ, Shenzhen Inst Artificial Intelligence, Sch Comp Sci & Software Engn, Comp Vis Inst,Guangdong Key Lab Intelligent Inform, Shenzhen 518060, Peoples R China
[2] Univ Cambridge, Dept Comp Sci & Technol, Cambridge CB2 1TN, England
关键词
Semantics; Cross layer design; Face recognition; Self-supervised learning; Representation learning; Faces; Task analysis; Facial expression recognition; contrastive learning; latent semantic alignment; multi-layer attention; NETWORK; ATTENTION;
D O I
10.1109/TIP.2024.3378459
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNNs) have achieved significant improvement for the task of facial expression recognition. However, current training still suffers from the inconsistent learning intensities among different layers, i.e., the feature representations in the shallow layers are not sufficiently learned compared with those in deep layers. To this end, this work proposes a contrastive learning framework to align the feature semantics of shallow and deep layers, followed by an attention module for representing the multi-scale features in the weight-adaptive manner. The proposed algorithm has three main merits. First, the learning intensity, defined as the magnitude of the backpropagation gradient, of the features on the shallow layer is enhanced by cross-layer contrastive learning. Second, the latent semantics in the shallow-layer and deep-layer features are explored and aligned in the contrastive learning, and thus the fine-grained characteristics of expressions can be taken into account for the feature representation learning. Third, by integrating the multi-scale features from multiple layers with an attention module, our algorithm achieved the state-of-the-art performances, i.e. 92.21%, 89.50%, 62.82%, on three in-the-wild expression databases, i.e. RAF-DB, FERPlus, SFEW, and the second best performance, i.e. 65.29% on AffectNet dataset. Our codes will be made publicly available.
引用
收藏
页码:2514 / 2529
页数:16
相关论文
共 50 条
  • [41] A Novel Multi-Feature Joint Learning Ensemble Framework for Multi-Label Facial Expression Recognition
    Li, Wanzhao
    Luo, Mingyuan
    Zhang, Peng
    Huang, Wei
    IEEE ACCESS, 2021, 9 : 119766 - 119777
  • [42] LAUNet: A Latent Action Units Network for Facial Expression Recognition
    Zhang, Junlin
    Hirota, Kaoru
    Dai, Yaping
    Yin, Sijie
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2513 - 2518
  • [43] Machine Learning based Efficient Facial Expression Recognition Algorithm
    Akram, Noreen
    Butt, Rizwan Aslam
    Akram, Ambreen
    Zaidi, Syed Rehan Ali
    2022 GLOBAL CONFERENCE ON WIRELESS AND OPTICAL TECHNOLOGIES (GCWOT), 2022, : 51 - 58
  • [44] Local and correlation attention learning for subtle facial expression recognition
    Wang, Shaocong
    Yuan, Yuan
    Zheng, Xiangtao
    Lu, Xiaoqiang
    NEUROCOMPUTING, 2021, 453 : 742 - 753
  • [45] Cross-modal contrastive learning for multimodal sentiment recognition
    Yang, Shanliang
    Cui, Lichao
    Wang, Lei
    Wang, Tao
    APPLIED INTELLIGENCE, 2024, 54 (05) : 4260 - 4276
  • [46] Learning Informative and Discriminative Features for Facial Expression Recognition in the Wild
    Li, Yingjian
    Lu, Yao
    Chen, Bingzhi
    Zhang, Zheng
    Li, Jinxing
    Lu, Guangming
    Zhang, David
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3178 - 3189
  • [47] MSL-FER: MIRRORED SELF-SUPERVISED LEARNING FOR FACIAL EXPRESSION RECOGNITION
    Pan, Xiangshuai
    Liu, Weifeng
    Wang, Yanjiang
    Lu, Xiaoping
    Liu, Baodi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1601 - 1605
  • [48] Facial expression recognition based on active region of interest using deep learning and parallelism
    Hossain, Mohammad Alamgir
    Assiri, Basem
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [49] Cross-modal contrastive learning for multimodal sentiment recognition
    Shanliang Yang
    Lichao Cui
    Lei Wang
    Tao Wang
    Applied Intelligence, 2024, 54 : 4260 - 4276
  • [50] Learning from More: Combating Uncertainty Cross-multidomain for Facial Expression Recognition
    Liu, Hanwei
    Cai, Huiling
    Lin, Qingcheng
    Li, Xuefeng
    Xiao, Hui
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5889 - 5898