FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients

被引:7
作者
Fan, Yiming [1 ]
Wang, Hewei [2 ]
Zhu, Xiaoyu [1 ]
Cao, Xiangming [3 ]
Yi, Chuanjian [4 ]
Chen, Yao [5 ]
Jia, Jie [2 ]
Lu, Xiaofeng [1 ,6 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
[2] Fudan Univ, Huashan Hosp, Dept Rehabil, Shanghai 200040, Peoples R China
[3] Nantong Univ, Dept Oncol, Jiangyin Peoples Hosp, Wuxi 214400, Peoples R China
[4] Qingdao Univ, Affiliated Hosp, Dept Rehabil, Qingdao 266000, Peoples R China
[5] Shanghai Third Rehabil Hosp, Dept Rehabil, Shanghai 200436, Peoples R China
[6] Shanghai Univ, Wenzhou Inst, Wenzhou 325000, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
facial expression recognition (FER); vision transformer (ViT); convolutional neural networks (CNNs); stroke; rehabilitation; SYSTEM;
D O I
10.3390/brainsci12121626
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Early rehabilitation with the right intensity contributes to the physical recovery of stroke survivors. In clinical practice, physicians determine whether the training intensity is suitable for rehabilitation based on patients' narratives, training scores, and evaluation scales, which puts tremendous pressure on medical resources. In this study, a lightweight facial expression recognition algorithm is proposed to diagnose stroke patients' training motivations automatically. First, the properties of convolution are introduced into the Vision Transformer's structure, allowing the model to extract both local and global features of facial expressions. Second, the pyramid-shaped feature output mode in Convolutional Neural Networks is also introduced to reduce the model's parameters and calculation costs significantly. Moreover, a classifier that can better classify facial expressions of stroke patients is designed to improve performance further. We verified the proposed algorithm on the Real-world Affective Faces Database (RAF-DB), the Face Expression Recognition Plus Dataset (FER+), and a private dataset for stroke patients. Experiments show that the backbone network of the proposed algorithm achieves better performance than Pyramid Vision Transformer (PvT) and Convolutional Vision Transformer (CvT) with fewer parameters and Floating-point Operations Per Second (FLOPs). In addition, the algorithm reaches an 89.44% accuracy on the RAF-DB dataset, which is higher than other recent studies. In particular, it obtains an accuracy of 99.81% on the private dataset, with only 4.10M parameters.
引用
收藏
页数:20
相关论文
共 52 条
[41]   Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition [J].
Wang, Kai ;
Peng, Xiaojiang ;
Yang, Jianfei ;
Meng, Debin ;
Qiao, Yu .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :4057-4069
[42]  
Wang W.H., 2021, P 2021 IEEECVF INT C
[43]  
Wang Y.N., 2021, West China Med. J, V36, P803
[44]   A Depression Diagnosis Method Based on the Hybrid Neural Network and Attention Mechanism [J].
Wang, Zhuozheng ;
Ma, Zhuo ;
Liu, Wei ;
An, Zhefeng ;
Huang, Fubiao .
BRAIN SCIENCES, 2022, 12 (07)
[45]   An Emotion Assessment of Stroke Patients by Using Bispectrum Features of EEG Signals [J].
Wen Yean, Choong ;
Wan Ahmad, Wan Khairunizam ;
Mustafa, Wan Azani ;
Murugappan, Murugappan ;
Rajamanickam, Yuvaraj ;
Adom, Abdul Hamid ;
Omar, Mohammad Iqbal ;
Zheng, Bong Siao ;
Junoh, Ahmad Kadri ;
Razlan, Zuradzman Mohamad ;
Bakar, Shahriman Abu .
BRAIN SCIENCES, 2020, 10 (10) :1-22
[46]  
Wen ZY, 2022, Arxiv, DOI arXiv:2109.07270
[47]   CvT: Introducing Convolutions to Vision Transformers [J].
Wu, Haiping ;
Xiao, Bin ;
Codella, Noel ;
Liu, Mengchen ;
Dai, Xiyang ;
Yuan, Lu ;
Zhang, Lei .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :22-31
[48]  
Xiao T., 2021, Advances in Neural Information Processing Systems, V34, P30392
[49]   Anxiety detection and training task adaptation in robot-assisted active stroke rehabilitation [J].
Xu, Guozheng ;
Gao, Xiang ;
Pan, Lizheng ;
Chen, Sheng ;
Wang, Qiang ;
Zhu, Bo ;
Li, Jinfei .
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2018, 15 (06)
[50]  
Yolcu G, 2017, IEEE INT C BIOINFORM, P1652, DOI 10.1109/BIBM.2017.8217907