A feature dependent Naive Bayes approach and its application to the software defect prediction problem

被引:89
作者
Arar, Omer Faruk [1 ]
Ayan, Kursat [1 ]
机构
[1] Sakarya Univ, Fac Comp & Informat Sci, Dept Comp Engn, Sakarya, Turkey
关键词
Naive Bayes; Feature independence; Software defect prediction; Discretization; Data mining; OBJECT-ORIENTED DESIGN; TOP; 10; ALGORITHMS; FAULT PREDICTION; METRICS; CLASSIFICATION; DISCRETIZATION; VALIDATION; ATTRIBUTES;
D O I
10.1016/j.asoc.2017.05.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Naive Bayes is one of the most widely used algorithms in classification problems because of its simplicity, effectiveness, and robustness. It is suitable for many learning scenarios, such as image classification, fraud detection, web mining, and text classification. Naive Bayes is a probabilistic approach based on assumptions that features are independent of each other and that their weights are equally important. However, in practice, features may be interrelated. In that case, such assumptions may cause a dramatic decrease in performance. In this study, by following preprocessing steps, a Feature Dependent Naive Bayes (FDNB) classification method is proposed. Features are included for calculation as pairs to create dependence between one another. This method was applied to the software defect prediction problem and experiments were carried out using widely recognized NASA PROMISE data sets. The obtained results show that this new method is more successful than the standard Naive Bayes approach and that it has a competitive performance with other feature-weighting techniques. A further aim of this study is to demonstrate that to be reliable, a learning model must be constructed by using only training data, as otherwise misleading results arise from the use of the entire data set. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:197 / 209
页数:13
相关论文
共 50 条
  • [31] Mobile Application Software Defect Prediction
    Ricky, Michael Yoseph
    Yulianto, Budi
    Purnomo, Fredy
    PROCEEDINGS 2016 IEEE SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING SOSE 2016, 2016, : 307 - 313
  • [32] A cognitive and neural network approach for software defect prediction
    Rajnish, Kumar
    Bhattacharjee, Vandana
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6477 - 6503
  • [33] A many objective based feature selection model for software defect prediction
    Mao, Qi
    Zhang, Jingbo
    Zhao, Tianhao
    Cai, Xingjuan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (19)
  • [34] Performance Analysis of Feature Selection Methods in Software Defect Prediction: A Search Method Approach
    Balogun, Abdullateef Oluwagbemiga
    Basri, Shuib
    Abdulkadir, Said Jadid
    Hashim, Ahmad Sobri
    APPLIED SCIENCES-BASEL, 2019, 9 (13):
  • [35] Software Defect Prediction Scheme Based on Feature Selection
    Wang, Pei
    Jin, Cong
    Jin, Shu-Wei
    2012 INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING (ISISE), 2012, : 477 - 480
  • [36] Deep Semantic Feature Learning for Software Defect Prediction
    Wang, Song
    Liu, Taiyue
    Nam, Jaechang
    Tan, Lin
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (12) : 1267 - 1293
  • [37] Feature Selection in Software Defect Prediction: A Comparative Study
    Kakkar, Misha
    Jain, Sarika
    2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 658 - 663
  • [38] FECAR: A Feature Selection Framework for Software Defect Prediction
    Liu, Shulong
    Chen, Xiang
    Liu, Wangshu
    Chen, Jiaqiang
    Gu, Qing
    Chen, Daoxu
    2014 IEEE 38TH ANNUAL INTERNATIONAL COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2014, : 426 - 435
  • [39] DISCRIMINATIVELY WEIGHTED NAIVE BAYES AND ITS APPLICATION IN TEXT CLASSIFICATION
    Jiang, Liangxiao
    Wang, Dianghong
    Cai, Zhihua
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2012, 21 (01)
  • [40] Prediction of Carbon Stock Available in Forest using Naive Bayes Approach
    Walia, Navjot Kaur
    Kalra, Parul
    Mehrotra, Deepti
    2016 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2016, : 275 - 279