OFES: Optimal feature evaluation and selection for multi-class classification

被引:9
作者
Ram, Vallam Sudhakar Sai [1 ]
Kayastha, Namrata [1 ]
Sha, Kewei [1 ]
机构
[1] Univ Houston Clear Lake, Dept Comp Sci, Houston, TX 77058 USA
关键词
Feature evaluation; Feature selection; Classification;
D O I
10.1016/j.datak.2022.102007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The complexity and accuracy of classification algorithms largely depend on the size and the quality of the feature set used to build classifiers. Feature evaluation and selection are critical steps to decide a small set of high-quality features to build accurate and efficient classifiers since low-quality features not only have negative impacts on classification results but also increase the complexity of classification algorithms. Current popular feature selection algorithms are not sufficient in selecting a set of high-quality features and discarding low-quality features, especially for streaming data. This paper proposes a novel and efficient approach, optimal feature evaluation and selection (OFES), to evaluate and select high-quality features for multi class classification. OFES first measures the difference between any two classes based on the feature that is to be evaluated. Then, it defines two quantitative measures to evaluate quality of the feature and identify high-quality features. Applying OFES in a multi-class classification application that identifies users based on their arm movement patterns, we find when compared with other popular feature evaluation and selection approaches, such as Information Gain Feature Ranking and Random Projections with Matlab feature ranking, OFES identifies a set of high-quality features that improves the accuracy of classification regardless of different classification algorithms. It also demonstrates great scalability with the increase of number of classes and yields a higher accuracy of 95%.
引用
收藏
页数:16
相关论文
共 49 条
  • [1] Al Kork SK, 2017, 2017 2ND INTERNATIONAL CONFERENCE ON BIO-ENGINEERING FOR SMART TECHNOLOGIES (BIOSMART)
  • [2] [Anonymous], 2016, P 25 INT JOINT C ART
  • [3] [Anonymous], 2017, 8th international conference of pattern recognition systems (ICPRS 2017), DOI DOI 10.1049/CP.2017.0131
  • [4] Arusada Muhammad Diaphan Nizam, 2017, 2017 5th International Conference on Information and Communication Technology (ICoICT), DOI 10.1109/ICoICT.2017.8074652
  • [5] Bai F., 2018, INT C DIG SIGN PROC, P1, DOI 10.1109/ICDSP.2018.8631672
  • [6] A normalized root-mean-square distance for comparing protein three-dimensional structures
    Carugo, O
    Pongor, S
    [J]. PROTEIN SCIENCE, 2001, 10 (07) : 1470 - 1473
  • [7] Smartphone User Identity Verification Using Gait Characteristics
    Damasevicius, Robertas
    Maskeliunas, Rytis
    Venckauskas, Algimantas
    Wozniak, Marcin
    [J]. SYMMETRY-BASEL, 2016, 8 (10):
  • [8] Human Activity Recognition in AAL Environments Using Random Projections
    Damasevicius, Robertas
    Vasiljevas, Mindaugas
    Salkevicius, Justas
    Wozniak, Marcin
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2016, 2016
  • [9] Derawi M. O., 2010, Proceedings of the 2010 Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIHMSP 2010), P306, DOI 10.1109/IIHMSP.2010.83
  • [10] Hand Dynamics for Behavioral User Authentication
    Garcia, Fuensanta Torres
    Krombholz, Katharina
    Mayer, Rudolf
    Weippl, Edgar
    [J]. PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, (ARES 2016), 2016, : 389 - 398