New feature selection and voting scheme to improve classification accuracy

被引:7
作者
Tsai, Cheng-Jung [1 ]
机构
[1] Natl Changhua Univ Educ, Grad Inst Stat & Informat Sci, Dept Math, Jin De Campus,1 Jin De Rd, Changhua 500, Taiwan
关键词
Data mining; Classification; Decision tree; Ensemble learning; Feature selection; Voting; ENSEMBLE; ALGORITHMS; CLASSIFIERS;
D O I
10.1007/s00500-019-03757-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification is a classic technique employed in data mining. Many ensemble learning methods have been introduced to improve the predictive accuracy of classification. A typical ensemble learning method consists of three steps: selection, building, and integration. Of the three steps, the first and third significantly affect the predictive accuracy of the classification. In this paper, we propose a new selection and integration scheme. Our method can improve the accuracy of subtrees and maintain their diversity. Through a new voting scheme, the predictive accuracy of ensemble learning is improved. We also theoretically analyzed the selection and integration steps of our method. The results of experimental analyses show that our method can achieve better accuracy than two state-of-the-art tree-based ensemble learning approaches.
引用
收藏
页码:12017 / 12030
页数:14
相关论文
共 50 条
  • [21] Feature selection algorithms to improve documents' classification performance
    Sousa, PAC
    Pimentao, JP
    Santos, BRD
    Moura-Pires, F
    [J]. ADVANCES IN WEB INTELLIGENCE, 2003, 2663 : 288 - 296
  • [22] Improve Abstract Data with Feature Selection for Classification Techniques
    Nuipian, Vatinee
    Meesad, Phayung
    Boonrawd, Pudsadee
    [J]. MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 3699 - +
  • [23] Feature selection inspired by human intelligence for improving classification accuracy of cancer types
    Shukla, Alok Kumar
    [J]. COMPUTATIONAL INTELLIGENCE, 2021, 37 (04) : 1571 - 1598
  • [24] Automatic Feature Selection by Regularization to Improve Bug Prediction Accuracy
    Osman, Haidar
    Ghafari, Mohammad
    Nierstrasz, Oscar
    [J]. 2017 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING TECHNIQUES FOR SOFTWARE QUALITY EVALUATION (MALTESQUE), 2017, : 27 - 32
  • [25] An automated feature selection and classification pipeline to improve explainability of clinical prediction models
    Moreno-Sanchez, Pedro A.
    [J]. 2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 527 - 534
  • [26] Optimal and Novel Hybrid Feature Selection Framework for Effective Data Classification
    Venkataraman, Sivakumar
    Selvaraj, Rajalakshmi
    [J]. ADVANCES IN SYSTEMS, CONTROL AND AUTOMATION, 2018, 442 : 499 - 514
  • [27] A Hybrid Feature Selection Method for Classification Purposes
    Cateni, Silvia
    Colla, Valentina
    Vannucci, Marco
    [J]. UKSIM-AMSS EIGHTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2014), 2014, : 39 - 44
  • [28] Feature selection using dynamic weights for classification
    Sun, Xin
    Liu, Yanheng
    Xu, Mantao
    Chen, Huiling
    Han, Jiawei
    Wang, Kunhao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 37 : 541 - 549
  • [29] A unified pipeline for online feature selection and classification
    Bolon-Canedo, Veronica
    Fernandez-Francos, Diego
    Peteiro-Barral, Diego
    Alonso-Betanzos, Amparo
    Guijarro-Berdinas, Bertha
    Sanchez-Marono, Noelia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 55 : 532 - 545
  • [30] Sub-Feature Selection for Novel Classification
    Bhuyan, Hemanta Kumar
    Reddy, C. V. Madhusudan
    [J]. PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 477 - 482