Multimodal Fusion-based Swin Transformer for Facial Recognition Micro-Expression Recognition

被引：9

作者：

Zhao, Xinhua ^{[1
]}

Lv, Yongjia ^{[1
]}

Huang, Zheng ^{[1
]}

机构：

[1] Harbin Engn Univ, Coll Automat, Harbin 150001, Peoples R China

来源：

PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022) | 2022年

关键词：

Micro-expression; Apex frame; Difference; Vision Transformer;

D O I：

10.1109/ICMA54519.2022.9856162

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Micro-expression recognition is the domain of vigorous computational vision research, which up against significant challenges stems from micro-expressions being spontaneous, brief and faint facial muscle movements. The paper presents a very novel method of Multimodal fusion micro-expression recognition using a visual transformer, which is not commonly used for micro-expression recognition. As compared to convolutional neural networks, transformers are widely thought to require more data. Then, we choose similar expression datasets to pre-training the model, while increasing the number of datasets.The results of the validation and evaluation of the model conducted with the CASME II, MMEW and SMIC datasets yielded state-of-the-art performance in terms of average accuracy of 81.50%, 82.97%, and 79.99%, respectively.When using Score-CAM to obtain the facial expression activation heat map, it is obvious that our model matches well with the expression action units. The proposed model obtains more promising recognition results than many other recognition methods.

引用

页码：780 / 785

页数：6

共 20 条

[1]

[Anonymous], 1966, Methods of research in psychotherapy, DOI [DOI 10.1007/978-1-4684-6045-214, DOI 10.1007/978-1-4684-6045-2_14]

[2] PERFORMANCE OF OPTICAL-FLOW TECHNIQUES [J].

BARRON, JL ;

FLEET, DJ ;

BEAUCHEMIN, SS .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 1994, 12 (01) :43-77

[3] Facial Micro-Expression Recognition Using Two-Dimensional Landmark Feature Maps [J].

Choi, Dong Yoon ;

Song, Byung Cheol .

IEEE ACCESS, 2020, 8 :121549-121563

[4]

Darwin C, 2015, EXPRESS EMOT MAN, P140

[5]

Dosovitskiy A, 2020, ARXIV

[6]

Goyal A, 2021, Arxiv, DOI arXiv:2011.15091

[7] A Survey on Vision Transformer [J].

Han, Kai ;

Wang, Yunhe ;

Chen, Hanting ;

Chen, Xinghao ;

Guo, Jianyuan ;

Liu, Zhenhua ;

Tang, Yehui ;

Xiao, An ;

Xu, Chunjing ;

Xu, Yixing ;

Yang, Zhaohui ;

Zhang, Yiman ;

Tao, Dacheng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :87-110

[8]

Khor HQ, 2019, IEEE IMAGE PROC, P36, DOI [10.1109/icip.2019.8802965, 10.1109/ICIP.2019.8802965]

[9]

King DE, 2009, J MACH LEARN RES, V10, P1755

[10]

Lei Ling, 2020, P 28 ACM INT C MULTI

← 1 2 →