Transformer embedded spectral-based graph network for facial expression recognition

被引:4
|
作者
Jin, Xing [1 ]
Song, Xulin [2 ]
Wu, Xiyin [3 ]
Yan, Wenzhu [4 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, 159 Longpan Rd, Nanjing 210037, Jiangsu, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Internet Things, 9 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
[3] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Guizhou, Peoples R China
[4] Nanjing Normal Univ, Sch Artificial Intelligence, Sch Comp & Elect Informat, 1 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph convolution; Facial expression; Transformer encoder; Action units;
D O I
10.1007/s13042-023-02016-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep graph convolution networks which exploit relations among facial muscle movements for facial expression recognition (FER) have achieved great success. Due to the limited receptive field, existing graph convolution operations are difficult to model long-range muscle movement relations which plays a crucial role in FER. To alleviate this issue, we introduce the transformer encoder into graph convolution networks, in which the vision transformer enables all facial muscle movements to interact in the global receptive field and model more complex relations. Specifically, we construct facial graph data by cropping regions of interest (ROIs) which are associated with facial action units, and each ROI is represented by the representation of hidden layers from deep auto-encoder. To effectively extract features from the constructed facial graph data, we propose a novel transformer embedded spectral-based graph convolution network (TESGCN), in which the transformer encoder is exploited to interact with complex relations among facial RIOs for FER. Compared to vanilla graph convolution networks, we empirically show the superiority of the proposed model by conducting extensive experiments across four facial expression datasets. Moreover, our proposed TESGCN only has 80K parameters and 0.41MB model size, and achieves comparable results compared to existing lightweight networks.
引用
收藏
页码:2063 / 2077
页数:15
相关论文
共 50 条
  • [1] Spectral-Based Directed Graph Network for Malware Detection
    Zhang, Zikai
    Li, Yidong
    Dong, Hairong
    Gao, Honghao
    Jin, Yi
    Wang, Wei
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (02): : 957 - 970
  • [2] A graph spectral-based scoring scheme for network comparison
    Gadiyaram V.
    Ghosh S.
    Vishveshwara S.
    Vishveshwara, Saraswathi (sv@mbu.iisc.ernet.in), 2017, Oxford University Press (05) : 219 - 244
  • [3] FERGCN: facial expression recognition based on graph convolution network
    Lei Liao
    Yu Zhu
    Bingbing Zheng
    Xiaoben Jiang
    Jiajun Lin
    Machine Vision and Applications, 2022, 33
  • [4] FERGCN: facial expression recognition based on graph convolution network
    Liao, Lei
    Zhu, Yu
    Zheng, Bingbing
    Jiang, Xiaoben
    Lin, Jiajun
    MACHINE VISION AND APPLICATIONS, 2022, 33 (03)
  • [5] Facial expression recognition using the spectral graph wavelet
    Meena, Hemant Kumar
    Sharma, Kamalesh Kumar
    Joshi, Shiv Dutt
    IET SIGNAL PROCESSING, 2019, 13 (02) : 224 - 229
  • [6] Multi-geometry embedded transformer for facial expression recognition in videos
    Chen, Dongliang
    Wen, Guihua
    Li, Huihui
    Yang, Pei
    Chen, Chuyun
    Wang, Bao
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [7] Learning Dynamic Relationships for Facial Expression Recognition Based on Graph Convolutional Network
    Jin, Xing
    Lai, Zhihui
    Jin, Zhong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 7143 - 7155
  • [8] Attentional visual graph neural network based facial expression recognition method
    Dong, Wenmin
    Zheng, Xiangwei
    Zhang, Lifeng
    Zhang, Yuang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (12) : 8693 - 8705
  • [9] Landmark-Based Adaptive Graph Convolutional Network for Facial Expression Recognition
    Zhao, Daqi
    Wang, Jingwen
    Li, Haoming
    Wang, Deqiang
    IEEE ACCESS, 2024, 12 : 136088 - 136102
  • [10] STATISTICAL, SPECTRAL AND GRAPH REPRESENTATIONS FOR VIDEO-BASED FACIAL EXPRESSION RECOGNITION IN CHILDREN
    Abbasi, Nida Itrat
    Song, Siyang
    Gunes, Hatice
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1725 - 1729