SAGA: A Hybrid Technique to handle Imbalance Data in Software Defect Prediction

被引:1
|
作者
Malhotra, Ruchika [1 ]
Kapoor, Ritvik [1 ]
Saxena, Paridhi [1 ]
Sharma, Parth [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Delhi, India
来源
11TH IEEE SYMPOSIUM ON COMPUTER APPLICATIONS & INDUSTRIAL ELECTRONICS (ISCAIE 2021) | 2021年
关键词
software defect prediction; data imbalance; ensemble; feature space partitioning; Genetic Algorithm; Synthetic Minority Oversampling; FEATURE-SELECTION; SMOTE;
D O I
10.1109/ISCAIE51753.2021.9431842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software defect prediction has been a concurrent topic in software quality-based research. Predictive models that identify defect prone parts of Software can be evolved from defect data and software metrics. Various studies conducted in the past have explored Machine Learning-based approaches for this purpose but the problem of handling imbalanced defect data without compromising on the model's performance remains at large. In this work, we have proposed, compared, and analyzed a hybrid technique, SAGA(SMOTE + AdaSS + Genetic Algorithm), for solving the imbalance problem faced in software defect prediction. SAGA employs ensemble classification based on feature space partitioning in conjunction with the Synthetic Minority Oversampling technique. Various parameters related to feature space partitioning are optimized using the Genetic Algorithm The values of ROC-AUC, G-mean, Balance, and Accuracy obtained on open-source datasets confirm the effectiveness of the proposed technique.
引用
收藏
页码:331 / 336
页数:6
相关论文
共 50 条
  • [21] The Impact Study of Class Imbalance on the Performance of Software Defect Prediction Models
    Yu Q.
    Jiang S.-J.
    Zhang Y.-M.
    Wang X.-Y.
    Gao P.-F.
    Qian J.-Y.
    Qian, Jun-Yan (qjy2000@gmail.com), 2018, Science Press (41): : 809 - 824
  • [22] Software Defect Prediction based on Conditional Random Field in Imbalance Distribution
    Yang, Chunhui
    Gao, Yan
    Xiang, Jianwen
    Liang, Lixin
    2015 2ND INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING AND INTERNET OF THINGS (DCIT), 2015, : 67 - 71
  • [23] Software Defect Prediction based on Adaboost algorithm under Imbalance Distribution
    Gao, Yan
    Yang, Chunhui
    PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON SENSORS, MECHATRONICS AND AUTOMATION (ICSMA 2016), 2016, 136 : 739 - 746
  • [24] Tool to Handle Imbalancing Problem in Software Defect Prediction Using Oversampling Methods
    Malhotra, Ruchika
    Kamal, Shine
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 906 - 912
  • [25] On the relative value of data resampling approaches for software defect prediction
    Bennin, Kwabena Ebo
    Keung, Jacky W.
    Monden, Akito
    EMPIRICAL SOFTWARE ENGINEERING, 2019, 24 (02) : 602 - 636
  • [26] Improving the undersampling technique by optimizing the termination condition for software defect prediction
    Feng, Shuo
    Keung, Jacky
    Xiao, Yan
    Zhang, Peichang
    Yu, Xiao
    Cao, Xiaochun
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [27] Combining Integreted Sampling Technique with Feature Selection for Software Defect Prediction
    Putri, Sukmawati Anggraeni
    Frieyadie
    2017 5TH INTERNATIONAL CONFERENCE ON CYBER AND IT SERVICE MANAGEMENT (CITSM 2017), 2017, : 235 - 240
  • [28] Software defect prediction using hybrid techniques: a systematic literature review
    Malhotra, Ruchika
    Chawla, Sonali
    Sharma, Anjali
    SOFT COMPUTING, 2023, 27 (12) : 8255 - 8288
  • [29] Deep neural network based hybrid approach for software defect prediction using software metrics
    Manjula, C.
    Florence, Lilly
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S9847 - S9863
  • [30] Genetic Feature Selection for Software Defect Prediction
    Wahono, Romi Satria
    Herman, Nanna Suryana
    ADVANCED SCIENCE LETTERS, 2014, 20 (01) : 239 - 244