Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition

被引：3

作者：

Xie, Weicheng ^{[1
]}

Peng, Zhibin ^{[1
]}

Shen, Linlin ^{[1
]}

Lu, Wenya ^{[1
]}

Zhang, Yang ^{[1
]}

Song, Siyang ^{[2
]}

机构：

[1] Shenzhen Univ, Shenzhen Inst Artificial Intelligence, Sch Comp Sci & Software Engn, Comp Vis Inst,Guangdong Key Lab Intelligent Inform, Shenzhen 518060, Peoples R China

[2] Univ Cambridge, Dept Comp Sci & Technol, Cambridge CB2 1TN, England

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

关键词：

Semantics; Cross layer design; Face recognition; Self-supervised learning; Representation learning; Faces; Task analysis; Facial expression recognition; contrastive learning; latent semantic alignment; multi-layer attention; NETWORK; ATTENTION;

D O I：

10.1109/TIP.2024.3378459

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional neural networks (CNNs) have achieved significant improvement for the task of facial expression recognition. However, current training still suffers from the inconsistent learning intensities among different layers, i.e., the feature representations in the shallow layers are not sufficiently learned compared with those in deep layers. To this end, this work proposes a contrastive learning framework to align the feature semantics of shallow and deep layers, followed by an attention module for representing the multi-scale features in the weight-adaptive manner. The proposed algorithm has three main merits. First, the learning intensity, defined as the magnitude of the backpropagation gradient, of the features on the shallow layer is enhanced by cross-layer contrastive learning. Second, the latent semantics in the shallow-layer and deep-layer features are explored and aligned in the contrastive learning, and thus the fine-grained characteristics of expressions can be taken into account for the feature representation learning. Third, by integrating the multi-scale features from multiple layers with an attention module, our algorithm achieved the state-of-the-art performances, i.e. 92.21%, 89.50%, 62.82%, on three in-the-wild expression databases, i.e. RAF-DB, FERPlus, SFEW, and the second best performance, i.e. 65.29% on AffectNet dataset. Our codes will be made publicly available.

引用

页码：2514 / 2529

页数：16

共 50 条

[41] A Novel Multi-Feature Joint Learning Ensemble Framework for Multi-Label Facial Expression Recognition
Li, Wanzhao
Luo, Mingyuan
Zhang, Peng
Huang, Wei
IEEE ACCESS, 2021, 9 : 119766 - 119777
[42] LAUNet: A Latent Action Units Network for Facial Expression Recognition
Zhang, Junlin
Hirota, Kaoru
Dai, Yaping
Yin, Sijie
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2513 - 2518
[43] Machine Learning based Efficient Facial Expression Recognition Algorithm
Akram, Noreen
Butt, Rizwan Aslam
Akram, Ambreen
Zaidi, Syed Rehan Ali
2022 GLOBAL CONFERENCE ON WIRELESS AND OPTICAL TECHNOLOGIES (GCWOT), 2022, : 51 - 58
[44] Local and correlation attention learning for subtle facial expression recognition
Wang, Shaocong
Yuan, Yuan
Zheng, Xiangtao
Lu, Xiaoqiang
NEUROCOMPUTING, 2021, 453 : 742 - 753
[45] Cross-modal contrastive learning for multimodal sentiment recognition
Yang, Shanliang
Cui, Lichao
Wang, Lei
Wang, Tao
APPLIED INTELLIGENCE, 2024, 54 (05) : 4260 - 4276
[46] Learning Informative and Discriminative Features for Facial Expression Recognition in the Wild
Li, Yingjian
Lu, Yao
Chen, Bingzhi
Zhang, Zheng
Li, Jinxing
Lu, Guangming
Zhang, David
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3178 - 3189
[47] MSL-FER: MIRRORED SELF-SUPERVISED LEARNING FOR FACIAL EXPRESSION RECOGNITION
Pan, Xiangshuai
Liu, Weifeng
Wang, Yanjiang
Lu, Xiaoping
Liu, Baodi
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1601 - 1605
[48] Facial expression recognition based on active region of interest using deep learning and parallelism
Hossain, Mohammad Alamgir
Assiri, Basem
PEERJ COMPUTER SCIENCE, 2022, 8
[49] Cross-modal contrastive learning for multimodal sentiment recognition
Shanliang Yang
Lichao Cui
Lei Wang
Tao Wang
Applied Intelligence, 2024, 54 : 4260 - 4276
[50] Learning from More: Combating Uncertainty Cross-multidomain for Facial Expression Recognition
Liu, Hanwei
Cai, Huiling
Lin, Qingcheng
Li, Xuefeng
Xiao, Hui
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5889 - 5898

← 1 2 3 4 5 →