Machine learning based methods for software fault prediction: A survey

被引:68
|
作者
Pandey, Sushant Kumar [1 ]
Mishra, Ravi Bhushan [1 ]
Tripathi, Anil Kumar [1 ]
机构
[1] Indian Inst Technol BHU, Dept Comp Sci & Engn, Varanasi, Uttar Pradesh, India
关键词
Machine learning; Fault proneness; Statistical techniques; Fault prediction; Systematic literature review; DEFECT PREDICTION; EMPIRICAL-ANALYSIS; FEATURE-SELECTION; MODEL; QUALITY; METRICS; CLASSIFICATION; PRONENESS; FRAMEWORK; REGRESSION;
D O I
10.1016/j.eswa.2021.114595
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several prediction approaches are contained in the arena of software engineering such as prediction of effort, security, quality, fault, cost, and re-usability. All these prediction approaches are still in the rudimentary phase. Experiments and research are conducting to build a robust model. Software Fault Prediction (SFP) is the process to develop the model which can be utilized by software practitioners to detect faulty classes/module before the testing phase. Prediction of defective modules before the testing phase will help the software development team leader to allocate resources more optimally and it reduces the testing effort. In this article, we present a Systematic Literature Review (SLR) of various studies from 1990 to June 2019 towards applying machine learning and statistical method over software fault prediction. We have cited 208 research articles, in which we studied 154 relevant articles. We investigated the competence of machine learning in existing datasets and research projects. To the best of our knowledge, the existing SLR considered only a few parameters over SFP?s performance, and they partially examined the various threats and challenges of SFP techniques. In this article, we aggregated those parameters and analyzed them accordingly, and we also illustrate the different challenges in the SFP domain. We also compared the performance between machine learning and statistical techniques based on SFP models. Our empirical study and analysis demonstrate that the prediction ability of machine learning techniques for classifying class/module as fault/non-fault prone is better than classical statistical models. The performance of machine learning-based SFP methods over fault susceptibility is better than conventional statistical purposes. The empirical evidence of our survey reports that the machine learning techniques have the capability, which can be used to identify fault proneness, and able to form well-generalized result. We have also investigated a few challenges in fault prediction discipline, i.e., quality of data, over-fitting of models, and class imbalance problem. We have also summarized 154 articles in a tabular form for quick identification.
引用
收藏
页数:35
相关论文
共 50 条
  • [31] An effective fault prediction model developed using an extreme learning machine with various kernel methods
    Kumar, Lov
    Tirkey, Anand
    Rath, Santanu-Ku
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (07) : 864 - 888
  • [32] Comparison of Machine Learning Techniques for Software Quality Prediction
    Goyal, Somya
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2020, 11 (02) : 20 - 40
  • [33] Machine Learning Methods for Quality Prediction in Production
    Sankhye, Sidharth
    Hu, Guiping
    LOGISTICS-BASEL, 2020, 4 (04):
  • [34] A Detailed Survey on Machine Intelligence Based Frameworks for Software Defect Prediction
    Singh, Raghvendra Omprkash
    Thankachan, Blessy
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 360 - 365
  • [35] Machine learning methods for prediction of cancer driver genes: a survey paper
    Andrades, Renan
    Recamonde-Mendoza, Mariana
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
  • [36] Machine Learning Methods for Septic Shock Prediction
    Darwiche, Aiman
    Mukherjee, Sumitra
    AIVR 2018: 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY, 2018, : 104 - 110
  • [37] Machine learning based success prediction for crowdsourcing software projects
    Illahi, Inam
    Liu, Hui
    Umer, Qasim
    Niu, Nan
    JOURNAL OF SYSTEMS AND SOFTWARE, 2021, 178
  • [38] An approach for fault prediction in SOA-based systems using machine learning techniques
    Bhandari, Guru Prasad
    Gupta, Ratneshwer
    Upadhyay, Satyanshu Kumar
    DATA TECHNOLOGIES AND APPLICATIONS, 2019, 53 (04) : 397 - 421
  • [39] On the use of deep learning in software defect prediction
    Giray, Gorkem
    Bennin, Kwabena Ebo
    Koksal, Omer
    Babur, Onder
    Tekinerdogan, Bedir
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 195
  • [40] A Combined-Learning Based Framework for Improved Software Fault Prediction
    Chubato Wondaferaw Yohannese
    Tianrui Li
    International Journal of Computational Intelligence Systems, 2017, 10 : 647 - 662