Combat with Class Overlapping in Software Defect Prediction Using Neighbourhood Metric

被引:0
作者
Gupta S. [1 ]
Richa [2 ]
Kumar R. [3 ,4 ]
Jain K.L. [3 ,4 ]
机构
[1] School of Computer Science Engineering, Vellore Institute of Technology, Chennai
[2] Department of Computer science and Engineering, Birla Institute of Technology, Mesra, Ranchi
[3] School of Electronics Engineering, Vellore Institute of Technology, Chennai
[4] School of Computer & Communication Engineering, Manipal University Jaipur, Jaipur
关键词
AUC; Class imbalance; Class overlap; G-mean; Recall; Software defect prediction;
D O I
10.1007/s42979-023-02082-8
中图分类号
学科分类号
摘要
The characteristics of data is a open problem which has been tended perceived in data analysis in machine learning research from last decades. The researcher defined some measures to identify the characteristics of the dataset by applying data complexity measures to find the fitness for purpose. The presence of class overlapping in data-sets, significantly affect performance of the classifiers. Data complexity measures provide quantitative insight in quality of the data set and overlapping existent in it. Machine learning techniques are also utilized by several researchers on healthcare datasets in software defect prediction. In this paper, our aim is to evaluates the effectiveness of new overlap measure: Near Enemy Ratio, and its effect on complexity measures and performance of the classifier. The new ration is based on nearest instances to the target instance. The experimental result offers insights in usefulness of the method and help us decide whether this solution should be applied on a particular data-set or not. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [41] COSTE: Complexity-based OverSampling TEchnique to alleviate the class imbalance problem in software defect prediction
    Feng, Shuo
    Keung, Jacky
    Yu, Xiao
    Xiao, Yan
    Bennin, Kwabena Ebo
    Kabir, Md Alamgir
    Zhang, Miao
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 129
  • [42] Class Balancing Approaches in Dataset for Software Defect Prediction: A Systematic Literature Review
    Olvera-Villeda, Dan Javier
    Sanchez-Garcia, Angel J.
    Limon, Xavier
    Dominguez Isidro, Saul
    2023 11TH INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION, CONISOFT 2023, 2023, : 236 - 245
  • [43] Cross-Project Software Defect Prediction Based on Class Code Similarity
    Wen, Wanzhi
    Shen, Chenqiang
    Lu, Xiaohong
    Li, Zhixian
    Wang, Haoren
    Zhang, Ruinian
    Zhu, Ningbo
    IEEE ACCESS, 2022, 10 : 105485 - 105495
  • [44] Which type of metrics are useful to deal with class imbalance in software defect prediction?
    Ozturk, Muhammed Maruf
    INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 92 : 17 - 29
  • [45] Software Defect Prediction using Oversampling Algorithm: A-SUWO
    Choirunnisa, Shabrina
    Meidyani, Biandina
    Rochimah, Siti
    2018 ELECTRICAL POWER, ELECTRONICS, COMMUNICATIONS, CONTROLS, AND INFORMATICS SEMINAR (EECCIS), 2018, : 337 - 341
  • [46] Software defect prediction using artificial immune recognition system
    Catal, Cagatay
    Diri, Banu
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 2007, : 285 - +
  • [47] Software Defect Prediction Using SMOTE and Artificial Neural Network
    Dipa, Wisnu Arya
    Sunindyo, Wikan Danar
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE): DATA AND SOFTWARE ENGINEERING FOR SUPPORTING SUSTAINABLE DEVELOPMENT GOALS, 2021,
  • [48] An Empirical Study on Software Defect Prediction Using CodeBERT Model
    Pan, Cong
    Lu, Minyan
    Xu, Biao
    APPLIED SCIENCES-BASEL, 2021, 11 (11):
  • [49] Software Defect Prediction Using Dynamic Support Vector Machine
    Shuai, Bo
    Li, Haifeng
    Li, Mengjun
    Zhang, Quan
    Tang, Chaojing
    2013 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2013, : 260 - 263
  • [50] Software Defect Prediction Analysis Using Machine Learning Techniques
    Khalid, Aimen
    Badshah, Gran
    Ayub, Nasir
    Shiraz, Muhammad
    Ghouse, Mohamed
    SUSTAINABILITY, 2023, 15 (06)