Two-Stage Hybrid Data Classifiers Based on SVM and kNN Algorithms

被引:25
作者
Demidova, Liliya A. [1 ]
机构
[1] MIREA Russian Technol Univ, Inst Informat Technol, Fed State Budget Educ Inst Higher Educ, 78 Vernadsky Avenye, Moscow 119454, Russia
来源
SYMMETRY-BASEL | 2021年 / 13卷 / 04期
关键词
binary SVM classifier; one-class SVM classifier; kNN classifier; hybrid classifier; class imbalance problem; SUPPORT VECTOR MACHINES; K-NEAREST NEIGHBORS; DIFFERENTIAL EVOLUTION; OPTIMIZATION; BIG; FRAMEWORK; DEEP;
D O I
10.3390/sym13040615
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The paper considers a solution to the problem of developing two-stage hybrid SVM-kNN classifiers with the aim to increase the data classification quality by refining the classification decisions near the class boundary defined by the SVM classifier. In the first stage, the SVM classifier with default parameters values is developed. Here, the training dataset is designed on the basis of the initial dataset. When developing the SVM classifier, a binary SVM algorithm or one-class SVM algorithm is used. Based on the results of the training of the SVM classifier, two variants of the training dataset are formed for the development of the kNN classifier: a variant that uses all objects from the original training dataset located inside the strip dividing the classes, and a variant that uses only those objects from the initial training dataset that are located inside the area containing all misclassified objects from the class dividing strip. In the second stage, the kNN classifier is developed using the new training dataset above-mentioned. The values of the parameters of the kNN classifier are determined during training to maximize the data classification quality. The data classification quality using the two-stage hybrid SVM-kNN classifier was assessed using various indicators on the test dataset. In the case of the improvement of the quality of classification near the class boundary defined by the SVM classifier using the kNN classifier, the two-stage hybrid SVM-kNN classifier is recommended for further use. The experimental results approve the feasibility of using two-stage hybrid SVM-kNN classifiers in the data classification problem. The experimental results obtained with the application of various datasets confirm the feasibility of using two-stage hybrid SVM-kNN classifiers in the data classification problem.
引用
收藏
页数:32
相关论文
共 69 条
[1]  
Alashwal H., 2006, Int. J. Biomed. Sci, V1, P120
[2]  
Anfyorov M.A, 2020, Russian Technological Journal, V7, P134, DOI [10.32362/2500-316X-2019-7-6-134-150, DOI 10.32362/2500-316X-2019-7-6-134-150]
[3]  
[Anonymous], 2015, INT J BIOSCI BIOTECH, DOI DOI 10.14257/IJBSBT.2015.7.1.05
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[6]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[7]   On Understanding Big Data Impacts in Remotely Sensed Image Classification Using Support Vector Machine Methods [J].
Cavallaro, Gabriele ;
Riedel, Morris ;
Richerzhagen, Matthias ;
Benediktsson, Jon Atli ;
Plaza, Antonio .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2015, 8 (10) :4634-4646
[8]   Choosing multiple parameters for support vector machines [J].
Chapelle, O ;
Vapnik, V ;
Bousquet, O ;
Mukherjee, S .
MACHINE LEARNING, 2002, 46 (1-3) :131-159
[9]   Deep, Big, Simple Neural Nets for Handwritten Digit Recognition [J].
Ciresan, Dan Claudiu ;
Meier, Ueli ;
Gambardella, Luca Maria ;
Schmidhuber, Juergen .
NEURAL COMPUTATION, 2010, 22 (12) :3207-3220
[10]  
Cortez P, 2008, 15TH EUROPEAN CONCURRENT ENGINEERING CONFERENCE/5TH FUTURE BUSINESS TECHNOLOGY CONFERENCE, P5