SAGA: A Hybrid Technique to handle Imbalance Data in Software Defect Prediction

被引:1
|
作者
Malhotra, Ruchika [1 ]
Kapoor, Ritvik [1 ]
Saxena, Paridhi [1 ]
Sharma, Parth [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Delhi, India
来源
11TH IEEE SYMPOSIUM ON COMPUTER APPLICATIONS & INDUSTRIAL ELECTRONICS (ISCAIE 2021) | 2021年
关键词
software defect prediction; data imbalance; ensemble; feature space partitioning; Genetic Algorithm; Synthetic Minority Oversampling; FEATURE-SELECTION; SMOTE;
D O I
10.1109/ISCAIE51753.2021.9431842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software defect prediction has been a concurrent topic in software quality-based research. Predictive models that identify defect prone parts of Software can be evolved from defect data and software metrics. Various studies conducted in the past have explored Machine Learning-based approaches for this purpose but the problem of handling imbalanced defect data without compromising on the model's performance remains at large. In this work, we have proposed, compared, and analyzed a hybrid technique, SAGA(SMOTE + AdaSS + Genetic Algorithm), for solving the imbalance problem faced in software defect prediction. SAGA employs ensemble classification based on feature space partitioning in conjunction with the Synthetic Minority Oversampling technique. Various parameters related to feature space partitioning are optimized using the Genetic Algorithm The values of ROC-AUC, G-mean, Balance, and Accuracy obtained on open-source datasets confirm the effectiveness of the proposed technique.
引用
收藏
页码:331 / 336
页数:6
相关论文
共 50 条
  • [31] Deep neural network based hybrid approach for software defect prediction using software metrics
    C. Manjula
    Lilly Florence
    Cluster Computing, 2019, 22 : 9847 - 9863
  • [32] Hybrid model with optimization tactics for software defect prediction
    Gollagi, Shantappa G.
    Balasubramaniam, S.
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2023, 14 (02)
  • [33] Class Imbalance Learning to Heterogeneous Cross-Software Projects Defect Prediction
    Vashisht, Rohit
    Rizvi, Syed Afzal Murtaza
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2022, 10 (01)
  • [34] Genetic algorithm-based oversampling approach to prune the class imbalance issue in software defect prediction
    Arun, C.
    Lakshmi, C.
    SOFT COMPUTING, 2022, 26 (23) : 12915 - 12931
  • [35] Hybrid Optimization-Based Neural Network Classifier for Software Defect Prediction
    Prashanthi, M.
    Mohan, M. Chandra
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024, 24 (04)
  • [36] An adaptive synthetic sampling and batch generation-oriented hybrid approach for addressing class imbalance problem in software defect prediction
    Taskeen, Anam
    Khan, Saif Ur Rehman
    Mashkoor, Atif
    Soft Computing, 2024, 28 (23) : 13595 - 13614
  • [37] Software Defect Prediction Based on Stability Test Data
    Okumoto, Kazu
    2011 INTERNATIONAL CONFERENCE ON QUALITY, RELIABILITY, RISK, MAINTENANCE, AND SAFETY ENGINEERING (ICQR2MSE), 2011, : 385 - 387
  • [38] Imbalanced Data Processing Model for Software Defect Prediction
    Lijuan Zhou
    Ran Li
    Shudong Zhang
    Hua Wang
    Wireless Personal Communications, 2018, 102 : 937 - 950
  • [39] A Systematic Data Collection Procedure for Software Defect Prediction
    Mausa, Goran
    Grbac, Tihana Galinac
    Basic, Bojana Dalbelo
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2016, 13 (01) : 173 - 197
  • [40] Data quality evaluation method in software defect prediction
    Li N.
    Guo Y.
    Wang X.
    Zhang L.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (11): : 24 - 29