A feature dependent Naive Bayes approach and its application to the software defect prediction problem

被引:89
作者
Arar, Omer Faruk [1 ]
Ayan, Kursat [1 ]
机构
[1] Sakarya Univ, Fac Comp & Informat Sci, Dept Comp Engn, Sakarya, Turkey
关键词
Naive Bayes; Feature independence; Software defect prediction; Discretization; Data mining; OBJECT-ORIENTED DESIGN; TOP; 10; ALGORITHMS; FAULT PREDICTION; METRICS; CLASSIFICATION; DISCRETIZATION; VALIDATION; ATTRIBUTES;
D O I
10.1016/j.asoc.2017.05.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Naive Bayes is one of the most widely used algorithms in classification problems because of its simplicity, effectiveness, and robustness. It is suitable for many learning scenarios, such as image classification, fraud detection, web mining, and text classification. Naive Bayes is a probabilistic approach based on assumptions that features are independent of each other and that their weights are equally important. However, in practice, features may be interrelated. In that case, such assumptions may cause a dramatic decrease in performance. In this study, by following preprocessing steps, a Feature Dependent Naive Bayes (FDNB) classification method is proposed. Features are included for calculation as pairs to create dependence between one another. This method was applied to the software defect prediction problem and experiments were carried out using widely recognized NASA PROMISE data sets. The obtained results show that this new method is more successful than the standard Naive Bayes approach and that it has a competitive performance with other feature-weighting techniques. A further aim of this study is to demonstrate that to be reliable, a learning model must be constructed by using only training data, as otherwise misleading results arise from the use of the entire data set. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:197 / 209
页数:13
相关论文
共 50 条
  • [21] A New Feature Selection Approach to Naive Bayes Text Classifiers
    Zhang, Lungan
    Jiang, Liangxiao
    Li, Chaoqun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (02)
  • [22] Analysis of Feature Selection Methods in Software Defect Prediction Models
    Ali, Misbah
    Mazhar, Tehseen
    Shahzad, Tariq
    Ghadi, Yazeed Yasin
    Mohsin, Syed Muhammad
    Akber, Syed Muhammad Abrar
    Ali, Mohammed
    IEEE ACCESS, 2023, 11 : 145954 - 145974
  • [23] Weighted software metrics aggregation and its application to defect prediction
    Ulan, Maria
    Lowe, Welf
    Ericsson, Morgan
    Wingkvist, Anna
    EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (05)
  • [24] Genetic Feature Selection for Software Defect Prediction
    Wahono, Romi Satria
    Herman, Nanna Suryana
    ADVANCED SCIENCE LETTERS, 2014, 20 (01) : 239 - 244
  • [25] LIMCR: Less-Informative Majorities Cleaning Rule Based on Naive Bayes for Imbalance Learning in Software Defect Prediction
    Wu, Yumei
    Yao, Jingxiu
    Chang, Shuo
    Liu, Bin
    APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 24
  • [26] Capsule feature selector for software defect prediction
    Tang, Yu
    Dai, Qi
    Du, Ye
    Zheng, Tian-shuai
    Li, Mei-hong
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (03)
  • [27] Diversity based multi-cluster over sampling approach to alleviate the class imbalance problem in software defect prediction
    Arun, C.
    Lakshmi, C.
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023,
  • [28] Deep neural network based hybrid approach for software defect prediction using software metrics
    Manjula, C.
    Florence, Lilly
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S9847 - S9863
  • [29] A sequential feature extraction approach for naive bayes classification of microarray data
    Fan, Liwei
    Poh, Kim-Leng
    Zhou, Peng
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 9919 - 9923
  • [30] A random approximate reduct-based ensemble learning approach and its application in software defect prediction
    Jiang, Feng
    Yu, Xu
    Gong, Dunwei
    Du, Junwei
    INFORMATION SCIENCES, 2022, 609 : 1147 - 1168