Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition

被引：3

作者：

Xie, Weicheng ^{[1
]}

Peng, Zhibin ^{[1
]}

Shen, Linlin ^{[1
]}

Lu, Wenya ^{[1
]}

Zhang, Yang ^{[1
]}

Song, Siyang ^{[2
]}

机构：

[1] Shenzhen Univ, Shenzhen Inst Artificial Intelligence, Sch Comp Sci & Software Engn, Comp Vis Inst,Guangdong Key Lab Intelligent Inform, Shenzhen 518060, Peoples R China

[2] Univ Cambridge, Dept Comp Sci & Technol, Cambridge CB2 1TN, England

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

关键词：

Semantics; Cross layer design; Face recognition; Self-supervised learning; Representation learning; Faces; Task analysis; Facial expression recognition; contrastive learning; latent semantic alignment; multi-layer attention; NETWORK; ATTENTION;

D O I：

10.1109/TIP.2024.3378459

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional neural networks (CNNs) have achieved significant improvement for the task of facial expression recognition. However, current training still suffers from the inconsistent learning intensities among different layers, i.e., the feature representations in the shallow layers are not sufficiently learned compared with those in deep layers. To this end, this work proposes a contrastive learning framework to align the feature semantics of shallow and deep layers, followed by an attention module for representing the multi-scale features in the weight-adaptive manner. The proposed algorithm has three main merits. First, the learning intensity, defined as the magnitude of the backpropagation gradient, of the features on the shallow layer is enhanced by cross-layer contrastive learning. Second, the latent semantics in the shallow-layer and deep-layer features are explored and aligned in the contrastive learning, and thus the fine-grained characteristics of expressions can be taken into account for the feature representation learning. Third, by integrating the multi-scale features from multiple layers with an attention module, our algorithm achieved the state-of-the-art performances, i.e. 92.21%, 89.50%, 62.82%, on three in-the-wild expression databases, i.e. RAF-DB, FERPlus, SFEW, and the second best performance, i.e. 65.29% on AffectNet dataset. Our codes will be made publicly available.

引用

页码：2514 / 2529

页数：16

共 50 条

[21] Visual-Textual Attribute Learning for Class-Incremental Facial Expression Recognition
Lv, Yuanling
Huang, Guangyu
Yan, Yan
Xue, Jing-Hao
Chen, Si
Wang, Hanzi
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8038 - 8051
[22] Learning Discriminative Dictionary for Facial Expression Recognition
Zhang, Shiqing
Zhao, Xiaoming
Chuang, Yuelong
Guo, Wenping
Chen, Ying
IETE TECHNICAL REVIEW, 2018, 35 (03) : 275 - 281
[23] Dynamic Objectives Learning for Facial Expression Recognition
Wen, Guihua
Chang, Tianyuan
Li, Huihui
Jiang, Lijun
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2914 - 2925
[24] Facial expression-based emotion recognition across diverse age groups: a multi-scale vision transformer with contrastive learning approach
Balachandran, G.
Ranjith, S.
Chenthil, T. R.
Jagan, G. C.
JOURNAL OF COMBINATORIAL OPTIMIZATION, 2025, 49 (01)
[25] Facial Expression Recognition via Deep Learning
Zhao, Xiaoming
Shi, Xugan
Zhang, Shiqing
IETE TECHNICAL REVIEW, 2015, 32 (05) : 347 - 355
[26] Transfer Model Collaborating Metric Learning and Dictionary Learning for Cross-Domain Facial Expression Recognition
Ni, Tongguang
Zhang, Cong
Gu, Xiaoqing
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (05) : 1213 - 1222
[27] Does Hard-Negative Contrastive Learning Improve Facial Emotion Recognition?
Win, Khin Cho
Akhtar, Zahid
Mohan, C. Krishna
PROCEEDINGS OF THE 2024 THE 7TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, ICMVA 2024, 2024, : 162 - 168
[28] Temporal-Contrastive Appearance Network for Facial Expression Recognition
Li, Zi-Jun
Liu, Yu-Hung
Liu, An-Sheng
Yang, Yu-Huan
Yeh, Tso-Hsin
Fu, Li-Chen
2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 2359 - 2364
[29] Learning informative and discriminative semantic features for robust facial expression recognition
Tan, Yumei
Xia, Haiying
Song, Shuxiang
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
[30] Cross-Domain Facial Expression Recognition via Contrastive Warm up and Complexity-Aware Self-Training
Li, Yingjian
Huang, Jiaxing
Lu, Shijian
Zhang, Zheng
Lu, Guangming
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5438 - 5450

← 1 2 3 4 5 →