IMNRFixer: A hybrid approach to alleviate class-imbalance problem for predicting the fixability of Non-Reproducible bugs

被引:2
|
作者
Goyal, Anjali [1 ]
Sardana, Neetu [2 ]
机构
[1] Amity Univ, Amity Sch Engn & Technol, Noida, Uttar Pradesh, India
[2] Jaypee Inst Informat Technol, Dept Comp Sci & Engn & Informat Technol, Noida, India
关键词
bug fixing; bug report; class-imbalance; classification; ensemble techniques; machine learning; mining software repositories; Non-Reproducible bugs; prediction tool; sampling; DECISION-MAKING; ENSEMBLE; CLASSIFICATION; TIME; FAILURES;
D O I
10.1002/smr.2290
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software maintenance is an important phase in the software development life cycle. Software projects maintain bug repositories to gather, organize, and keep track of bug reports. These bug reports are resolved by numerous software developers. Whenever the reported bug does not get resolved by the assigned developer, he marks the resolution of bug report as Non-Reproducible (NR). When NR bugs are reconsidered, few of them get resolved, and their resolution changes from NR to fix (NRF). The main aim of this paper is to predict these fixable NRF bug reports. A major challenge in predicting NRF bugs from NR bugs is that only a small portion of NR bugs get fixed, i.e., class-imbalance problem. For example, NRF bugs account for only 8.64%, 4.73 %, 4.56%, and 1.06% in NetBeans, Eclipse, Open Office, and Mozilla Firefox projects respectively. In this paper, we work on improving the classification performance on these imbalanced datasets. We propose IMNRFixer, a novel and hybrid NRF prediction tool. IM(NRFixer)uses three different techniques to combat class-imbalance problem: undersampling, oversampling, and ensemble models. We evaluate the performance of IM(NRFixer)models on four large and open-source projects of Bugzilla repository. Our results show that IM(NRFixer)outperforms conventional machine learning techniques. IM(NRFixer)achieves performance up to 71.7%, 93.1%, 91.7%, and 96.5% while predicting the minority class (NRF) for NetBeans, Eclipse, Open Office, and Mozilla Firefox projects, respectively.
引用
收藏
页数:29
相关论文
共 9 条
  • [1] NRFixer: Sentiment based model for predicting the fixability of non-reproducible bugs
    Goyal A.
    Sardana N.
    E-Informatica Software Engineering Journal, 2017, 11 (01) : 103 - 116
  • [2] NRPredictor: an ensemble learning and feature selection based approach for predicting the non-reproducible bugs
    Bansal, Kulbhushan
    Singh, Gopal
    Malik, Sunesh
    Rohil, Harish
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023, 14 (03) : 989 - 1009
  • [3] NRPredictor: an ensemble learning and feature selection based approach for predicting the non-reproducible bugs
    Kulbhushan Bansal
    Gopal Singh
    Harish Sunesh Malik
    International Journal of System Assurance Engineering and Management, 2023, 14 : 989 - 1009
  • [4] A novel data augmentation approach to fault diagnosis with class-imbalance problem
    Tian, Jilun
    Jiang, Yuchen
    Zhang, Jiusi
    Luo, Hao
    Yin, Shen
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 243
  • [5] An Improved Hybrid Approach for Handling Class Imbalance Problem
    Abeer S. Desuky
    Sadiq Hussain
    Arabian Journal for Science and Engineering, 2021, 46 : 3853 - 3864
  • [6] An Improved Hybrid Approach for Handling Class Imbalance Problem
    Desuky, Abeer S.
    Hussain, Sadiq
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (04) : 3853 - 3864
  • [7] Hybrid Approach Redefinition (HAR) Method with Loss Factors in Handling Class Imbalance Problem
    Hartono
    Ongko, Erianto
    Sitompul, Opim Salim
    Tulus
    Nababan, Erna Budhiarti
    Abdullah, Dahlan
    2018 INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT INFORMATICS (SAIN), 2018, : 56 - 61
  • [8] Diversity based multi-cluster over sampling approach to alleviate the class imbalance problem in software defect prediction
    Arun, C.
    Lakshmi, C.
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023,
  • [9] An adaptive synthetic sampling and batch generation-oriented hybrid approach for addressing class imbalance problem in software defect prediction
    Taskeen, Anam
    Khan, Saif Ur Rehman
    Mashkoor, Atif
    Soft Computing, 2024, 28 (23) : 13595 - 13614