Efficient Net-XGBoost: An Implementation for Facial Emotion Recognition Using Transfer Learning

被引:12
作者
Punuri, Sudheer Babu [1 ]
Kuanar, Sanjay Kumar [1 ]
Kolhar, Manjur [2 ]
Mishra, Tusar Kanti [3 ]
Alameen, Abdalla [4 ]
Mohapatra, Hitesh [5 ]
Mishra, Soumya Ranjan [5 ]
机构
[1] GIET Univ, CSE Dept, Gunupur 765022, Orissa, India
[2] Prince Sattam Bin Abdulaziz Univ, Coll Arts & Sci, Dept Comp Sci, Al Kharj 16278, Saudi Arabia
[3] Vellore Inst Technol, Sch Comp Sci & Engn, Vellore 632014, Tamil Nadu, India
[4] Prince Sattam Bin Abdul Aziz Univ, Comp Sci Dept, Al Kharj 16278, Saudi Arabia
[5] KIIT Deemed Be Univ, Sch Comp Engn, Bhubaneswar 751024, Orissa, India
关键词
facial emotion recognition; transfer learning; deep learning; EfficientNet; XGBoost; CONVOLUTIONAL NEURAL-NETWORKS; EXPRESSION RECOGNITION; DEEP;
D O I
10.3390/math11030776
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Researchers are interested in Facial Emotion Recognition (FER) because it could be useful in many ways and has promising applications. The main task of FER is to identify and recognize the original facial expressions of users from digital inputs. Feature extraction and emotion recognition make up the majority of the traditional FER. Deep Neural Networks, specifically Convolutional Neural Network (CNN), are popular and highly used in FER due to their inherent image feature extraction process. This work presents a novel method dubbed as EfficientNet-XGBoost that is based on Transfer Learning (TL) technique. EfficientNet-XGBoost is basically a cascading of the EfficientNet and the XGBoost techniques along with certain enhancements by experimentation that reflects the novelty of the work. To ensure faster learning of the network and to overcome the vanishing gradient problem, our model incorporates fully connected layers of global average pooling, dropout and dense. EfficientNet is fine-tuned by replacing the upper dense layer(s) and cascading the XGBoost classifier making it suitable for FER. Feature map visualization is carried out that reveals the reduction in the size of feature vectors. The proposed method is well-validated on benchmark datasets such as CK+, KDEF, JAFFE, and FER2013. To overcome the issue of data imbalance, in some of the datasets namely CK+ and FER2013, we augmented data artificially through geometric transformation techniques. The proposed method is implemented individually on these datasets and corresponding results are recorded for performance analysis. The performance is computed with the help of several metrics like precision, recall and F1 measure. Comparative analysis with competent schemes are carried out on the same sample data sets separately. Irrespective of the nature of the datasets, the proposed scheme outperforms the rest with overall rates of accuracy being 100%, 98% and 98% for the first three datasets respectively. However, for the FER2013 datasets, efficiency is less promisingly observed in support of the proposed work.
引用
收藏
页数:24
相关论文
共 35 条
  • [1] Facial Emotion Recognition Using Transfer Learning in the Deep CNN
    Akhand, M. A. H.
    Roy, Shuvendu
    Siddique, Nazmul
    Kamal, Md Abdus Samad
    Shimamura, Tetsuya
    [J]. ELECTRONICS, 2021, 10 (09)
  • [2] Alshamsi H, 2017, 2017 8TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), P384, DOI 10.1109/IEMCON.2017.8117150
  • [3] Aouayeb M., 2021, arXiv
  • [4] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [5] Facial expression recognition based on local binary patterns
    Feng X.
    Pietikäinen M.
    Hadid A.
    [J]. Pattern Recogn. Image Anal., 2007, 4 (592-598): : 592 - 598
  • [6] Multiple Attention Network for Facial Expression Recognition
    Gan, Yanling
    Chen, Jingying
    Yang, Zongkai
    Xu, Luhui
    [J]. IEEE ACCESS, 2020, 8 : 7383 - 7393
  • [7] Goodfellow Ian J., 2013, Neural Information Processing. 20th International Conference, ICONIP 2013. Proceedings: LNCS 8228, P117, DOI 10.1007/978-3-642-42051-1_16
  • [8] Goyani MaheshM., 2018, International Journal of Next-Generation Computing, P131, DOI 10.17485/ijst/2017/v10i9/108944
  • [9] Automatic Facial Expression Recognition Using Features of Salient Facial Patches
    Happy, S. L.
    Routray, Aurobinda
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2015, 6 (01) : 1 - 12
  • [10] Robust Facial Expression Recognition Based on Local Directional Pattern
    Jabid, Taskeed
    Kabir, Md. Hasanul
    Chae, Oksam
    [J]. ETRI JOURNAL, 2010, 32 (05) : 784 - 794