Learning to disentangle emotion factors for facial expression recognition in the wild

被引：19

作者：

Zhu, Qing ^{[1
]}

Gao, Lijian ^{[1
]}

Song, Heping ^{[1
]}

Mao, Qirong ^{[1
,2
]}

机构：

[1] Jiangsu Univ, Dept Comp Sci & Commun Engn, Zhenjiang 212013, Jiangsu, Peoples R China

[2] Jiangsu Engn Res Ctr Big Data Ubiquitous Percept, Zhenjiang, Jiangsu, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS | 2021年 / 36卷 / 06期

基金：

中国国家自然科学基金;

关键词：

attention mechanism; disentangled representation learning; facial expression recognition; feature learning;

D O I：

10.1002/int.22391

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Facial expression recognition (FER) in the wild is a very challenging problem due to different expressions under complex scenario (e.g., large head pose, illumination variation, occlusions, etc.), leading to suboptimal FER performance. Accuracy in FER heavily relies on discovering superior discriminative, emotion-related features. In this paper, we propose an end-to-end module to disentangle latent emotion discriminative factors from the complex factors variables for FER to obtain salient emotion features. The training of proposed method contains two stages. First of all, emotion samples are used to obtain the latent representation using a variational auto-encoder with reconstruction penalization. Furthermore, the latent representation as the input is thrown into a disentangling layer to learn a set of discriminative emotion factors through the attention mechanism (e.g., a Squeeze-and-Excitation block) that encourages to separate emotion-related factors and nonaffective factors. Experimental results on public benchmark databases (RAF-DB and FER2013) show that our approach has remarkable performance in complex scenes than current state-of-the-art methods.

引用

页码：2511 / 2527

页数：17

共 63 条

[1]

[Anonymous], 2014, NEURAL INFORM PROCES

[2]

Barsoum E., 2016, Proceedings of the 18th ACM International Conference on Multimodal Interaction, p279

[3] Representation Learning: A Review and New Perspectives [J].

Bengio, Yoshua ;

Courville, Aaron ;

Vincent, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828

[4]

Breuer R., 2017, ARXIV PREPRINT ARXIV

[5]

Burkert P., 2015, CoRR

[6]

Cai J., 2018, 13 IEEE INT C AUTOMA

[7] Toward Real-World Single Image Super-Resolution: A New Benchmark and A New Model [J].

Cai, Jianrui ;

Zeng, Hui ;

Yong, Hongwei ;

Cao, Zisheng ;

Zhang, Lei .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3086-3095

[8]

Chen JZ, 2016, PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), P551, DOI [10.1109/CIS.2016.133, 10.1109/CIS.2016.0134]

[9]

Chen JW., 2017, 3 INT C LEARNING REP, P147

[10]

Chen X, 2016, ADV NEUR IN, V29

← 1 2 3 4 5 6 7 →