Exploring Software Quality Classification with a Wrapper-Based Feature Ranking Technique

被引:0
|
作者
Gao, Kehan [1 ]
Khoshgoftaar, Taghi [2 ]
Napolitano, Amri [2 ]
机构
[1] Eastern Connecticut State Univ, Willimantic, CT 06226 USA
[2] Florida Atlantic Univ, Boca Raton, FL 33431 USA
来源
ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE | 2009年
关键词
performance metric; feature ranking technique; software quality modeling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a process of selecting a subset of relevant features for building learning models. It is an important activity for data preprocessing used in software quality modeling and other data mining problems. Feature selection algorithms can be divided into two categories, feature ranking and feature subset selection. Feature ranking orders the features by a criterion and a user selects some of the features that are appropriate for a given scenario. Feature subset selection techniques search the space of possible feature subsets and evaluate the suitability of each. This paper investigates performance metric based feature ranking techniques by using the multilayer perceptron (MLP) learner with nine different performance metrics. The nine performance metrics include Overall Accuracy (OA), Default F-Measure (DFM), Default Geometric Mean (DGM), Default Arithmetic Mean (DAM), Area Under ROC (AUC), Area Under PRC (PRC), Best F-Measure (BFM), Best Geometric Mean (BGM) and Best Arithmetic Mean (BAM). The goal of the paper is to study the effect of the different performance metrics on the feature ranking results, which in turn influences the classification performance. We assessed the performance of the classification models constructed on those selected feature subsets through an empirical case study that was carried out on six data sets of real-world software systems. The results demonstrate that AUC, PRC, BFM, BGM and BAM as performance metrics for feature ranking outperformed the other performance metrics, OA, DFM, DGM and DAM, unanimously across all the data sets and therefore are recommended based on this study. In addition, the performances of the classification models were maintained or even improved when over 85 percent of the,features were eliminated from the original data sets.
引用
收藏
页码:67 / +
页数:2
相关论文
共 50 条
  • [1] Wrapper-based Feature Ranking for Software Engineering Metrics
    Altidor, Wilker
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 241 - 246
  • [2] An Empirical Study on Wrapper-based Feature Ranking
    Altidor, Wilker
    Khoshgoftaar, Taghi M.
    Van Hulse, Jason
    ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, 2009, : 75 - 82
  • [3] A COMPARATIVE STUDY OF FILTER-BASED AND WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR SOFTWARE QUALITY MODELING
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    Bullard, Lofton A.
    INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2011, 18 (04) : 341 - 364
  • [4] WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR DETERMINING RELEVANCE OF SOFTWARE ENGINEERING METRICS
    Altidor, Wilker
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2010, 17 (05) : 425 - 464
  • [5] An Empirical Investigation on Wrapper-Based Feature Selection for Predicting Software Quality
    Wang, Huanjing
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2015, 25 (01) : 93 - 114
  • [6] Filter-Wrapper based Feature Ranking Technique for Dynamic Software Quality Attributes
    Kamaruddin, Siti Sakira
    Yahaya, Jamaiah
    Deraman, Aziz
    Ahmad, Ruzita
    PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2012, 2012, : 604 - 608
  • [7] A Novel Wrapper-Based Optimization Algorithm for the Feature Selection and Classification
    Talpur, Noureen
    Abdulkadir, Said Jadid
    Hasan, Mohd Hilmi
    Alhussian, Hitham
    Alwadain, Ayed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 5799 - 5820
  • [8] An Empirical Study on Wrapper-based Feature Selection for Software Engineering Data
    Wang, Huanjing
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 2, 2013, : 84 - 89
  • [9] Improving Incremental Wrapper-Based Feature Subset Selection by Using Re-ranking
    Bermejo, Pablo
    Gamez, Jose A.
    Puerta, Jose M.
    TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT I, PROCEEDINGS, 2010, 6096 : 580 - 589
  • [10] Improving performance for classification with incomplete data using wrapper-based feature selection
    Tran C.T.
    Zhang M.
    Andreae P.
    Xue B.
    Evolutionary Intelligence, 2016, 9 (3) : 81 - 94