Dual subspace manifold learning based on GCN for intensity-invariant facial expression recognition

被引：11

作者：

Chen, Jingying ^{[1
,2
]}

Shi, Jinxin ^{[1
,2
,3
]}

Xu, Ruyi ^{[1
,2
]}

机构：

[1] Cent China Normal Univ, Fac Artificial Intelligence Educ, Wuhan 430079, Peoples R China

[2] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China

[3] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 148卷

关键词：

Graph convolutional network; Semi-supervised learning; Intensity-invariant representation; Manifold learning; Facial expression recognition;

D O I：

10.1016/j.patcog.2023.110157

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Facial expression recognition (FER) is one of the most important computer vision tasks for understanding human inner emotions. However, the poor generation ability of the FER model limits its applicability due to tremendous intraclass variation. Especially for expressions of varying intensities, the appearance differences among weak expressions are subtle, which makes FER tasks challenging. In response to these issues, this paper presents a dual subspace manifold learning method based on a graph convolutional network (GCN) for intensity-invariant FER tasks. Our method treats the target task as a node classification problem and learns the manifold representation using two subspace analysis methods: locality preserving projection (LPP) and peak piloted locality preserving projection (PLPP). Inspired by the classic LPP, which maintains local similarity among data, this paper introduces a novel PLPP that maintains the locality between peak expressions and non-peak expressions to enhance the representation of weak expressions. This paper also reports two subspace fusion methods, one based on a weighted adjacency matrix and another on a self-attention mechanism, that combine the LPP and PLPP to further improve FER performance. The second method achieves a recognition accuracy of 93.83% on the CK+, 74.86% on the Oulu-CASIA and 75.37% on the MMI for weak expressions, outperforming state-of-the-art methods.

引用

页数：16

共 40 条

[1]

Alsubaiee S., 2015, P 2 INT ACM WORKSH M, P1, DOI [DOI 10.1145/2786006.2786007, 10.1049/cp.2015.1766]

[2]

[Anonymous], 2005, International Journal of Information Technology

[3]

Bartlett MS, 2005, PROC CVPR IEEE, P568

[4] Orthogonal channel attention-based multi-task learning for multi-view facial expression recognition [J].

Chen, Jingying ;

Yang, Lei ;

Tan, Lei ;

Xu, Ruyi .

PATTERN RECOGNITION, 2022, 129

[5] Toward Children's Empathy Ability Analysis: Joint Facial Expression Recognition and Intensity Estimation Using Label Distribution Learning [J].

Chen, Jingying ;

Guo, Chen ;

Xu, Ruyi ;

Zhang, Kun ;

Yang, Zongkai ;

Liu, Honghai .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (01) :16-25

[6] Natural facial expression recognition using differential-AAM and manifold learning [J].

Cheon, Yeongjae ;

Kim, Daijin .

PATTERN RECOGNITION, 2009, 42 (07) :1340-1350

[7] FaceNet2ExpNet: Regularizing a Deep Face Recognition Net for Expression Recognition [J].

Ding, Hui ;

Zhou, Shaohua Kevin ;

Chellappa, Rama .

2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, :118-126

[8]

Dosovitskiy A, 2021, INT C LEARN REPR ICL

[9] Learning head pose-insensitive and discriminative deep features for smile detection [J].

Gan, Yanling ;

Chen, Jingying ;

Xu, Luhui .

JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)

[10]

He XF, 2004, ADV NEUR IN, V16, P153

← 1 2 3 4 →