Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition

被引:3
|
作者
Xie, Weicheng [1 ]
Peng, Zhibin [1 ]
Shen, Linlin [1 ]
Lu, Wenya [1 ]
Zhang, Yang [1 ]
Song, Siyang [2 ]
机构
[1] Shenzhen Univ, Shenzhen Inst Artificial Intelligence, Sch Comp Sci & Software Engn, Comp Vis Inst,Guangdong Key Lab Intelligent Inform, Shenzhen 518060, Peoples R China
[2] Univ Cambridge, Dept Comp Sci & Technol, Cambridge CB2 1TN, England
关键词
Semantics; Cross layer design; Face recognition; Self-supervised learning; Representation learning; Faces; Task analysis; Facial expression recognition; contrastive learning; latent semantic alignment; multi-layer attention; NETWORK; ATTENTION;
D O I
10.1109/TIP.2024.3378459
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNNs) have achieved significant improvement for the task of facial expression recognition. However, current training still suffers from the inconsistent learning intensities among different layers, i.e., the feature representations in the shallow layers are not sufficiently learned compared with those in deep layers. To this end, this work proposes a contrastive learning framework to align the feature semantics of shallow and deep layers, followed by an attention module for representing the multi-scale features in the weight-adaptive manner. The proposed algorithm has three main merits. First, the learning intensity, defined as the magnitude of the backpropagation gradient, of the features on the shallow layer is enhanced by cross-layer contrastive learning. Second, the latent semantics in the shallow-layer and deep-layer features are explored and aligned in the contrastive learning, and thus the fine-grained characteristics of expressions can be taken into account for the feature representation learning. Third, by integrating the multi-scale features from multiple layers with an attention module, our algorithm achieved the state-of-the-art performances, i.e. 92.21%, 89.50%, 62.82%, on three in-the-wild expression databases, i.e. RAF-DB, FERPlus, SFEW, and the second best performance, i.e. 65.29% on AffectNet dataset. Our codes will be made publicly available.
引用
收藏
页码:2514 / 2529
页数:16
相关论文
共 50 条
  • [21] Visual-Textual Attribute Learning for Class-Incremental Facial Expression Recognition
    Lv, Yuanling
    Huang, Guangyu
    Yan, Yan
    Xue, Jing-Hao
    Chen, Si
    Wang, Hanzi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8038 - 8051
  • [22] Learning Discriminative Dictionary for Facial Expression Recognition
    Zhang, Shiqing
    Zhao, Xiaoming
    Chuang, Yuelong
    Guo, Wenping
    Chen, Ying
    IETE TECHNICAL REVIEW, 2018, 35 (03) : 275 - 281
  • [23] Dynamic Objectives Learning for Facial Expression Recognition
    Wen, Guihua
    Chang, Tianyuan
    Li, Huihui
    Jiang, Lijun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2914 - 2925
  • [24] Facial expression-based emotion recognition across diverse age groups: a multi-scale vision transformer with contrastive learning approach
    Balachandran, G.
    Ranjith, S.
    Chenthil, T. R.
    Jagan, G. C.
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2025, 49 (01)
  • [25] Facial Expression Recognition via Deep Learning
    Zhao, Xiaoming
    Shi, Xugan
    Zhang, Shiqing
    IETE TECHNICAL REVIEW, 2015, 32 (05) : 347 - 355
  • [26] Transfer Model Collaborating Metric Learning and Dictionary Learning for Cross-Domain Facial Expression Recognition
    Ni, Tongguang
    Zhang, Cong
    Gu, Xiaoqing
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (05) : 1213 - 1222
  • [27] Does Hard-Negative Contrastive Learning Improve Facial Emotion Recognition?
    Win, Khin Cho
    Akhtar, Zahid
    Mohan, C. Krishna
    PROCEEDINGS OF THE 2024 THE 7TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, ICMVA 2024, 2024, : 162 - 168
  • [28] Temporal-Contrastive Appearance Network for Facial Expression Recognition
    Li, Zi-Jun
    Liu, Yu-Hung
    Liu, An-Sheng
    Yang, Yu-Huan
    Yeh, Tso-Hsin
    Fu, Li-Chen
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 2359 - 2364
  • [29] Learning informative and discriminative semantic features for robust facial expression recognition
    Tan, Yumei
    Xia, Haiying
    Song, Shuxiang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [30] Cross-Domain Facial Expression Recognition via Contrastive Warm up and Complexity-Aware Self-Training
    Li, Yingjian
    Huang, Jiaxing
    Lu, Shijian
    Zhang, Zheng
    Lu, Guangming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5438 - 5450