SAGA: A Hybrid Technique to handle Imbalance Data in Software Defect Prediction

被引:1
|
作者
Malhotra, Ruchika [1 ]
Kapoor, Ritvik [1 ]
Saxena, Paridhi [1 ]
Sharma, Parth [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Delhi, India
来源
11TH IEEE SYMPOSIUM ON COMPUTER APPLICATIONS & INDUSTRIAL ELECTRONICS (ISCAIE 2021) | 2021年
关键词
software defect prediction; data imbalance; ensemble; feature space partitioning; Genetic Algorithm; Synthetic Minority Oversampling; FEATURE-SELECTION; SMOTE;
D O I
10.1109/ISCAIE51753.2021.9431842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software defect prediction has been a concurrent topic in software quality-based research. Predictive models that identify defect prone parts of Software can be evolved from defect data and software metrics. Various studies conducted in the past have explored Machine Learning-based approaches for this purpose but the problem of handling imbalanced defect data without compromising on the model's performance remains at large. In this work, we have proposed, compared, and analyzed a hybrid technique, SAGA(SMOTE + AdaSS + Genetic Algorithm), for solving the imbalance problem faced in software defect prediction. SAGA employs ensemble classification based on feature space partitioning in conjunction with the Synthetic Minority Oversampling technique. Various parameters related to feature space partitioning are optimized using the Genetic Algorithm The values of ROC-AUC, G-mean, Balance, and Accuracy obtained on open-source datasets confirm the effectiveness of the proposed technique.
引用
收藏
页码:331 / 336
页数:6
相关论文
共 50 条
  • [1] Class Imbalance Data-Generation for Software Defect Prediction
    Li, Zheng
    Zhang, Xingyao
    Guo, Junxia
    Shang, Ying
    2019 26TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC), 2019, : 276 - 283
  • [2] An Empirical Study on Data Sampling Methods in Addressing Class Imbalance Problem in Software Defect Prediction
    Odejide, Babajide J.
    Bajeh, Amos O.
    Balogun, Abdullateef O.
    Alanamu, Zubair O.
    Adewole, Kayode S.
    Akintola, Abimbola G.
    Salihu, Shakirat A.
    Usman-Hamza, Fatima E.
    Mojeed, Hammed A.
    SOFTWARE ENGINEERING PERSPECTIVES IN SYSTEMS, VOL. 1, 2022, 501 : 594 - 610
  • [3] COSTE: Complexity-based OverSampling TEchnique to alleviate the class imbalance problem in software defect prediction
    Feng, Shuo
    Keung, Jacky
    Yu, Xiao
    Xiao, Yan
    Bennin, Kwabena Ebo
    Kabir, Md Alamgir
    Zhang, Miao
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 129
  • [4] Support Vector based Oversampling Technique for Handling Class Imbalance in Software Defect Prediction
    Malhotra, Ruchika
    Agrawal, Vaibhav
    Pal, Vedansh
    Agarwal, Tushar
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 1078 - 1083
  • [5] Software Defect Prediction using Hybrid Approach
    Thant, Myo Wai
    Aung, Nyein Thwet Thwet
    2019 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGIES (ICAIT), 2019, : 262 - 267
  • [6] Class Imbalance Reduction (CIR): A Novel Approach to Software Defect Prediction in the Presence of Class Imbalance
    Bejjanki, Kiran Kumar
    Gyani, Jayadev
    Gugulothu, Narsimha
    SYMMETRY-BASEL, 2020, 12 (03):
  • [7] Credibility Based Imbalance Boosting Method for Software Defect Proneness Prediction
    Tong, Haonan
    Wang, Shihai
    Li, Guangling
    APPLIED SCIENCES-BASEL, 2020, 10 (22): : 1 - 29
  • [8] A Survey of Different Approaches for the Class Imbalance Problem in Software Defect Prediction
    Dar, Abdul Waheed
    Farooq, Sheikh Umar
    INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2022, 14 (01):
  • [9] Using Class Imbalance Learning for Software Defect Prediction
    Wang, Shuo
    Yao, Xin
    IEEE TRANSACTIONS ON RELIABILITY, 2013, 62 (02) : 434 - 443
  • [10] CFIWSE: A Hybrid Preprocessing Approach for Defect Prediction on Imbalance Real-World Datasets
    Xu, Jiaxi
    Shang, Jingwei
    Huang, Zhichang
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 392 - 401