SL-Swin: A Transformer-Based Deep Learning Approach for Macro- and Micro-Expression Spotting on Small-Size Expression Datasets

被引:8
作者
He, Erheng [1 ]
Chen, Qianru [2 ]
Zhong, Qinghua [1 ,2 ]
机构
[1] South China Normal Univ, Sch Phys & Telecommun Engn, Guangzhou 510006, Peoples R China
[2] South China Normal Univ, Sch Elect & Informat Engn, Foshan 528225, Peoples R China
关键词
macro- and micro-expression spotting; image processing; computer vision; artificial intelligence; deep learning; swin transformer; shifted patch tokenization; locality self-attention;
D O I
10.3390/electronics12122656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the analysis of macro- and micro-expression has drawn the attention of researchers. These expressions provide visual cues to an individual's emotions, which can be used in a broad range of potential applications such as lie detection and policing. In this paper, we address the challenge of spotting facial macro- and micro-expression from videos and present compelling results by using a deep learning approach to analyze the optical flow features. Unlike other deep learning approaches that are mainly based on Convolutional Neural Networks (CNNs), we propose a Transformer-based deep learning approach that predicts a score indicating the probability of a frame being within an expression interval. In contrast to other Transformer-based models that achieve high performance by being pre-trained on large datasets, our deep learning model, called SL-Swin, which incorporates Shifted Patch Tokenization and Locality Self-Attention into the backbone Swin Transformer network, effectively spots macro- and micro-expressions by being trained from scratch on small-size expression datasets. Our evaluation outcomes surpass the MEGC 2022 spotting baseline result, obtaining an overall F1-score of 0.1366. Additionally, our approach performs well on the MEGC 2021 spotting task, with an overall F1-score of 0.1824 and 0.1357 on the CAS(ME)2 and SAMM Long Videos, respectively. The code is publicly available on GitHub.
引用
收藏
页数:18
相关论文
共 41 条
[1]   Video-Based Facial Micro-Expression Analysis: A Survey of Datasets, Features and Algorithms [J].
Ben, Xianye ;
Ren, Yi ;
Zhang, Junping ;
Wang, Su-Jing ;
Kpalma, Kidiyo ;
Meng, Weixiao ;
Liu, Yong-Jin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :5826-5846
[2]   Objective Classes for Micro-Facial Expression Recognition [J].
Davison, Adrian K. ;
Merghani, Walied ;
Yap, Moi Hoon .
JOURNAL OF IMAGING, 2018, 4 (10)
[3]   SAMM: A Spontaneous Micro-Facial Movement Dataset [J].
Davison, Adrian K. ;
Lansley, Cliff ;
Costen, Nicholas ;
Tan, Kevin ;
Yap, Moi Hoon .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (01) :116-129
[4]   Micro-Facial Movement Detection Using Individualised Baselines and Histogram-Based Descriptors [J].
Davison, Adrian K. ;
Yap, Moi Hoon ;
Lansley, Cliff .
2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, :1864-1869
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[8]   Micro-Expression Spotting using the Riesz Pyramid [J].
Duque, Carlos Arango ;
Alata, Olivier ;
Emonet, Remi ;
Legrand, Anne-Claire ;
Konik, Hubert .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :66-74
[9]   CAS(ME)2: A Database for Spontaneous Macro-Expression and Micro-Expression Spotting and Recognition [J].
Qu, Fangbing ;
Wang, Su-Jing ;
Yan, Wen-Jing ;
Li, He ;
Wu, Shuhang ;
Fu, Xiaolan .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) :424-436
[10]  
Gen Bing Liong, 2022, FME '22: Proceedings of the 2nd Workshop on Facial Micro-Expression: Advanced Techniques for Multi-Modal Facial Expression Analysis, P3, DOI 10.1145/3552465.3555040