SL-Swin: A Transformer-Based Deep Learning Approach for Macro- and Micro-Expression Spotting on Small-Size Expression Datasets

被引：8

作者：

He, Erheng ^{[1
]}

Chen, Qianru ^{[2
]}

Zhong, Qinghua ^{[1
,2
]}

机构：

[1] South China Normal Univ, Sch Phys & Telecommun Engn, Guangzhou 510006, Peoples R China

[2] South China Normal Univ, Sch Elect & Informat Engn, Foshan 528225, Peoples R China

来源：

ELECTRONICS | 2023年 / 12卷 / 12期

关键词：

macro- and micro-expression spotting; image processing; computer vision; artificial intelligence; deep learning; swin transformer; shifted patch tokenization; locality self-attention;

D O I：

10.3390/electronics12122656

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, the analysis of macro- and micro-expression has drawn the attention of researchers. These expressions provide visual cues to an individual's emotions, which can be used in a broad range of potential applications such as lie detection and policing. In this paper, we address the challenge of spotting facial macro- and micro-expression from videos and present compelling results by using a deep learning approach to analyze the optical flow features. Unlike other deep learning approaches that are mainly based on Convolutional Neural Networks (CNNs), we propose a Transformer-based deep learning approach that predicts a score indicating the probability of a frame being within an expression interval. In contrast to other Transformer-based models that achieve high performance by being pre-trained on large datasets, our deep learning model, called SL-Swin, which incorporates Shifted Patch Tokenization and Locality Self-Attention into the backbone Swin Transformer network, effectively spots macro- and micro-expressions by being trained from scratch on small-size expression datasets. Our evaluation outcomes surpass the MEGC 2022 spotting baseline result, obtaining an overall F1-score of 0.1366. Additionally, our approach performs well on the MEGC 2021 spotting task, with an overall F1-score of 0.1824 and 0.1357 on the CAS(ME)2 and SAMM Long Videos, respectively. The code is publicly available on GitHub.

引用

页数：18

共 41 条

[1] Video-Based Facial Micro-Expression Analysis: A Survey of Datasets, Features and Algorithms [J].

Ben, Xianye ;

Ren, Yi ;

Zhang, Junping ;

Wang, Su-Jing ;

Kpalma, Kidiyo ;

Meng, Weixiao ;

Liu, Yong-Jin .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :5826-5846

[2] Objective Classes for Micro-Facial Expression Recognition [J].

Davison, Adrian K. ;

Merghani, Walied ;

Yap, Moi Hoon .

JOURNAL OF IMAGING, 2018, 4 (10)

[3] SAMM: A Spontaneous Micro-Facial Movement Dataset [J].

Davison, Adrian K. ;

Lansley, Cliff ;

Costen, Nicholas ;

Tan, Kevin ;

Yap, Moi Hoon .

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (01) :116-129

[4] Micro-Facial Movement Detection Using Individualised Baselines and Histogram-Based Descriptors [J].

Davison, Adrian K. ;

Yap, Moi Hoon ;

Lansley, Cliff .

2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, :1864-1869

[5]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[6]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[7]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

[8] Micro-Expression Spotting using the Riesz Pyramid [J].

Duque, Carlos Arango ;

Alata, Olivier ;

Emonet, Remi ;

Legrand, Anne-Claire ;

Konik, Hubert .

2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :66-74

[9] CAS(ME)2: A Database for Spontaneous Macro-Expression and Micro-Expression Spotting and Recognition [J].

Qu, Fangbing ;

Wang, Su-Jing ;

Yan, Wen-Jing ;

Li, He ;

Wu, Shuhang ;

Fu, Xiaolan .

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) :424-436

[10]

Gen Bing Liong, 2022, FME '22: Proceedings of the 2nd Workshop on Facial Micro-Expression: Advanced Techniques for Multi-Modal Facial Expression Analysis, P3, DOI 10.1145/3552465.3555040

← 1 2 3 4 5 →