Instance selection using one-versus-all and one-versus-one decomposition approaches in multiclass classification datasets

被引:3
作者
Fang, Ching-Lin [1 ]
Wang, Ming-Chang [1 ]
Tsai, Chih-Fong [2 ]
Lin, Wei-Chao [3 ,4 ]
Liao, Pei-Qi [2 ]
机构
[1] Natl Chung Cheng Univ, Dept Business Adm, Chiayi, Taiwan
[2] Natl Cent Univ, Dept Informat Management, Taoyuan, Taiwan
[3] Chang Gung Univ, Dept Informat Management, Taoyuan, Taiwan
[4] Chang Gung Mem Hosp Linkou, Dept Thorac Surg, Taoyuan, Taiwan
关键词
data mining; instance selection; machine learning; multiclass classification; one-versus-all; one-versus-one; REDUCTION; BINARY;
D O I
10.1111/exsy.13217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance is important in data analysis and mining; it filters out unrepresentative, redundant, or noisy data from a given training set to obtain effective model learning. Various instance selection algorithms are proposed in the literature, and their potential and applicability in data cleaning and preprocessing steps are demonstrated. For multiclass classification datasets, the existing instance selection algorithms must deal with all the instances across the different classes simultaneously to produce a reduced training set. Generally, every multiclass classification dataset can be regarded as a complex domain problem, which can be effectively solved using the divide-and-conquer principle. In this study, the one-versus-all (OVA) and one-versus-one (OVO) decomposition approaches were used to decompose a multiclass dataset into multiple binary class datasets. These approaches have been widely employed when constructing the classifier but have never been considered in instance selection. The results of instance selection performance obtained with the OVA, OVO, and baseline approaches were assessed and compared for 20 different domain multiclass datasets as the first study and five medical domain datasets as the validation study. Furthermore, three instance selection algorithms were compared, including IB3, DROP3, and GA. The results demonstrate that using the OVO approach to perform instance selection can make the support vector machine (SVM) and k-nearest neighbour (k-NN) classifiers perform significantly better than the OVA and baseline approaches in terms of the area under the ROC curve (AUC) rate, regardless of the instance selection algorithm used. Moreover, the OVO approach can provide reasonably good data reduction rates and processing times, which are all better than those of the OVA approach.
引用
收藏
页数:13
相关论文
共 47 条
  • [1] INSTANCE-BASED LEARNING ALGORITHMS
    AHA, DW
    KIBLER, D
    ALBERT, MK
    [J]. MACHINE LEARNING, 1991, 6 (01) : 37 - 66
  • [2] Alasadi S. A., 2017, Journal of Engineering and Applied Sciences, V12, P4102, DOI DOI 10.3923/JEASCI.2017.4102.4107
  • [3] Allen M., 2015, Multi-Domain master data management: Advanced MDM and data governance in practice
  • [4] Reducing multiclass to binary: A unifying approach for margin classifiers
    Allwein, EL
    Schapire, RE
    Singer, Y
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) : 113 - 141
  • [5] Data reduction based on NN-kNN measure for NN classification and regression
    An, Shuang
    Hu, Qinghua
    Wang, Changzhong
    Guo, Ge
    Li, Piyu
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (03) : 765 - 781
  • [6] A review of instance selection methods
    Arturo Olvera-Lopez, J.
    Ariel Carrasco-Ochoa, J.
    Francisco Martinez-Trinidad, J.
    Kittler, Josef
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (02) : 133 - 143
  • [7] Efficient and decision boundary aware instance selection for support vector machines
    Aslani, Mohammad
    Seipel, Stefan
    [J]. INFORMATION SCIENCES, 2021, 577 : 579 - 598
  • [8] Bishop C. M., 2007, Pattern Recognition and Machine Learning Information Science and Statistics, V1st
  • [10] Using evolutionary algorithms as instance selection for data reduction in KDD: An experimental study
    Cano, JR
    Herrera, F
    Lozano, M
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2003, 7 (06) : 561 - 575