Few-Shot Learning with a Novel Voronoi Tessellation-Based Image Augmentation Method for Facial Palsy Detection

被引:19
作者
Abayomi-Alli, Olusola Oluwakemi [1 ]
Damasevicius, Robertas [1 ]
Maskeliunas, Rytis [2 ,3 ]
Misra, Sanjay [4 ,5 ]
机构
[1] Kaunas Univ Technol, Dept Software Engn, LT-51368 Kaunas, Lithuania
[2] Vytautas Magnus Univ, Dept Appl Informat, LT-44404 Kaunas, Lithuania
[3] Silesian Tech Univ, Fac Appl Math, PL-44100 Gliwice, Poland
[4] Covenant Univ, Dept Elect & Informat Engn, Ota 112212, Ogun State, Nigeria
[5] Atilim Univ, Dept Comp Engn, TR-06830 Ankara, Turkey
关键词
data augmentation; small data; Voronoi tessellation; few-shot learning; deep learning; face recognition; face palsy; NERVE PARALYSIS; RECOGNITION; FEATURES; NETWORK;
D O I
10.3390/electronics10080978
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Face palsy has adverse effects on the appearance of a person and has negative social and functional consequences on the patient. Deep learning methods can improve face palsy detection rate, but their efficiency is limited by insufficient data, class imbalance, and high misclassification rate. To alleviate the lack of data and improve the performance of deep learning models for palsy face detection, data augmentation methods can be used. In this paper, we propose a novel Voronoi decomposition-based random region erasing (VDRRE) image augmentation method consisting of partitioning images into randomly defined Voronoi cells as an alternative to rectangular based random erasing method. The proposed method augments the image dataset with new images, which are used to train the deep neural network. We achieved an accuracy of 99.34% using two-shot learning with VDRRE augmentation on palsy faces from Youtube Face Palsy (YFP) dataset, while normal faces are taken from Caltech Face Database. Our model shows an improvement over state-of-the-art methods in the detection of facial palsy from a small dataset of face images.
引用
收藏
页数:18
相关论文
共 60 条
[1]   Helping the Visually Impaired See via Image Multi-labeling Based on SqueezeNet CNN [J].
Alhichri, Haikel ;
Bazi, Yakoub ;
Alajlan, Naif ;
Jdira, Bilel Bin .
APPLIED SCIENCES-BASEL, 2019, 9 (21)
[2]  
[Anonymous], 1999, CALTECH FACE DATABAS
[3]   Clinician-Graded Electronic Facial Paralysis Assessment: The eFACE [J].
Banks, Caroline A. ;
Bhama, Prabhat K. ;
Park, Jong ;
Hadlock, Charles R. ;
Hadlock, Tessa A. .
PLASTIC AND RECONSTRUCTIVE SURGERY, 2015, 136 (02) :223E-230E
[4]  
Bochkovskiy A., 2020, YOLOv4: Optimal Speed and Accuracy of Object Detection
[5]   Albumentations: Fast and Flexible Image Augmentations [J].
Buslaev, Alexander ;
Iglovikov, Vladimir I. ;
Khvedchenya, Eugene ;
Parinov, Alex ;
Druzhinin, Mikhail ;
Kalinin, Alexandr A. .
INFORMATION, 2020, 11 (02)
[6]  
Chen P., 2020, ARXIV
[7]  
DeVries Terrance, 2017, Improved regulariza
[8]   Centroidal Voronoi tessellations: Applications and algorithms [J].
Du, Q ;
Faber, V ;
Gunzburger, M .
SIAM REVIEW, 1999, 41 (04) :637-676
[9]   IMPROVING MODEL SELECTION BY NONCONVERGENT METHODS [J].
FINNOFF, W ;
HERGERT, F ;
ZIMMERMANN, HG .
NEURAL NETWORKS, 1993, 6 (06) :771-783
[10]  
Forrest N. Iandola, 2016, ARXIV