A feature dependent Naive Bayes approach and its application to the software defect prediction problem

被引:89
|
作者
Arar, Omer Faruk [1 ]
Ayan, Kursat [1 ]
机构
[1] Sakarya Univ, Fac Comp & Informat Sci, Dept Comp Engn, Sakarya, Turkey
关键词
Naive Bayes; Feature independence; Software defect prediction; Discretization; Data mining; OBJECT-ORIENTED DESIGN; TOP; 10; ALGORITHMS; FAULT PREDICTION; METRICS; CLASSIFICATION; DISCRETIZATION; VALIDATION; ATTRIBUTES;
D O I
10.1016/j.asoc.2017.05.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Naive Bayes is one of the most widely used algorithms in classification problems because of its simplicity, effectiveness, and robustness. It is suitable for many learning scenarios, such as image classification, fraud detection, web mining, and text classification. Naive Bayes is a probabilistic approach based on assumptions that features are independent of each other and that their weights are equally important. However, in practice, features may be interrelated. In that case, such assumptions may cause a dramatic decrease in performance. In this study, by following preprocessing steps, a Feature Dependent Naive Bayes (FDNB) classification method is proposed. Features are included for calculation as pairs to create dependence between one another. This method was applied to the software defect prediction problem and experiments were carried out using widely recognized NASA PROMISE data sets. The obtained results show that this new method is more successful than the standard Naive Bayes approach and that it has a competitive performance with other feature-weighting techniques. A further aim of this study is to demonstrate that to be reliable, a learning model must be constructed by using only training data, as otherwise misleading results arise from the use of the entire data set. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:197 / 209
页数:13
相关论文
共 50 条
  • [1] Software Defect Prediction with Naive Bayes Classifier
    Rahim, Aqsa
    Hayat, Zara
    Abbas, Muhammad
    Rahim, Amna
    Rahim, Muhammad Abdul
    PROCEEDINGS OF 2021 INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGIES (IBCAST), 2021, : 293 - 297
  • [2] Software Defect Prediction Using Principal Component Analysis and Naive Bayes Algorithm
    Dhamayanthi, N.
    Lavanya, B.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING (ICCIDE 2018), 2019, 28 : 241 - 248
  • [3] Software defect prediction: Heuristics for weighted Naive Bayes
    Turhan, Burak
    Bener, Ayse
    ICSOFT 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL SE: SOFTWARE ENGINEERING, 2007, : 244 - 249
  • [4] An improved software defect prediction model based on grey incidence analysis and Naive Bayes algorithm
    Zhang, Shu
    Wang, Yuhong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6047 - 6060
  • [5] A new weighted naive Bayes method based on information diffusion for software defect prediction
    Haijin Ji
    Song Huang
    Yaning Wu
    Zhanwei Hui
    Changyou Zheng
    Software Quality Journal, 2019, 27 : 923 - 968
  • [6] A new weighted naive Bayes method based on information diffusion for software defect prediction
    Ji, Haijin
    Huang, Song
    Wu, Yaning
    Hui, Zhanwei
    Zheng, Changyou
    SOFTWARE QUALITY JOURNAL, 2019, 27 (03) : 923 - 968
  • [7] Empirical Studies of a Kernel Density Estimation Based Naive Bayes Method for Software Defect Prediction
    Ji, Haijin
    Huang, Song
    Lv, Xuewei
    Wu, Yaning
    Feng, Yuntian
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (01) : 75 - 84
  • [8] Deep feature weighting for naive Bayes and its application to text classification
    Jiang, Liangxiao
    Li, Chaoqun
    Wang, Shasha
    Zhang, Lungan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 52 : 26 - 39
  • [9] A feature selection approach based on a similarity measure for software defect prediction
    Yu, Qiao
    Jiang, Shu-juan
    Wang, Rong-cun
    Wang, Hong-yang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (11) : 1744 - 1753
  • [10] Cross-Project Defect Prediction For Web Application Using Naive Bayes (Case Study: Petstore Web Application)
    Habibi, Puja Ahmad
    Amrizal, Victor
    Bahaweres, Rizal Broer
    2018 INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS), 2018, : 13 - 18