Combat with Class Overlapping in Software Defect Prediction Using Neighbourhood Metric

被引:0
作者
Gupta S. [1 ]
Richa [2 ]
Kumar R. [3 ,4 ]
Jain K.L. [3 ,4 ]
机构
[1] School of Computer Science Engineering, Vellore Institute of Technology, Chennai
[2] Department of Computer science and Engineering, Birla Institute of Technology, Mesra, Ranchi
[3] School of Electronics Engineering, Vellore Institute of Technology, Chennai
[4] School of Computer & Communication Engineering, Manipal University Jaipur, Jaipur
关键词
AUC; Class imbalance; Class overlap; G-mean; Recall; Software defect prediction;
D O I
10.1007/s42979-023-02082-8
中图分类号
学科分类号
摘要
The characteristics of data is a open problem which has been tended perceived in data analysis in machine learning research from last decades. The researcher defined some measures to identify the characteristics of the dataset by applying data complexity measures to find the fitness for purpose. The presence of class overlapping in data-sets, significantly affect performance of the classifiers. Data complexity measures provide quantitative insight in quality of the data set and overlapping existent in it. Machine learning techniques are also utilized by several researchers on healthcare datasets in software defect prediction. In this paper, our aim is to evaluates the effectiveness of new overlap measure: Near Enemy Ratio, and its effect on complexity measures and performance of the classifier. The new ration is based on nearest instances to the target instance. The experimental result offers insights in usefulness of the method and help us decide whether this solution should be applied on a particular data-set or not. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [31] Effective software defect prediction using support vector machines (SVMs)
    Somya Goyal
    International Journal of System Assurance Engineering and Management, 2022, 13 : 681 - 696
  • [32] A Survey of Different Approaches for the Class Imbalance Problem in Software Defect Prediction
    Dar, Abdul Waheed
    Farooq, Sheikh Umar
    INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2022, 14 (01):
  • [33] An Empirical Study on Data Sampling Methods in Addressing Class Imbalance Problem in Software Defect Prediction
    Odejide, Babajide J.
    Bajeh, Amos O.
    Balogun, Abdullateef O.
    Alanamu, Zubair O.
    Adewole, Kayode S.
    Akintola, Abimbola G.
    Salihu, Shakirat A.
    Usman-Hamza, Fatima E.
    Mojeed, Hammed A.
    SOFTWARE ENGINEERING PERSPECTIVES IN SYSTEMS, VOL. 1, 2022, 501 : 594 - 610
  • [34] The impact of the distance metric and measure on SMOTE-based techniques in software defect prediction
    Feng, Shuo
    Keung, Jacky
    Zhang, Peichang
    Xiao, Yan
    Zhang, Miao
    INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 142
  • [35] Software Defect Prediction using Convolutional Neural Network
    Wongpheng, Kittisak
    Visutsak, Porawat
    35TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2020), 2020, : 240 - 243
  • [36] Software Defect Prediction Using Augmented Bayesian Networks
    Muthukumaran, K.
    Srinivas, Suri
    Malapati, Aruna
    Neti, Lalita Bhanu Murthy
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2016), 2018, 614 : 279 - 293
  • [37] Software defect prediction using a bidirectional LSTM network combined with oversampling techniques
    Khleel, Nasraldeen Alnor Adam
    Nehez, Karoly
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3615 - 3638
  • [38] Influence Analysis Method of Class Imbalance on Software Defect Prediction Model Stability and Prediction Performance
    Zhang Y.-M.
    Zhi S.-L.
    Jiang S.-J.
    Yuan G.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (08): : 2076 - 2087
  • [39] An Empirical Study on Software Defect Prediction Using Over-Sampling by SMOTE
    Pak, Cholmyong
    Wang, Tian Tian
    Su, Xiao Hong
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2018, 28 (06) : 811 - 830
  • [40] Holistic Parameter Optimization for Software Defect Prediction
    Lee, Jaewook
    Choi, Jiwon
    Ryu, Duksan
    Kim, Suntae
    IEEE ACCESS, 2022, 10 : 106781 - 106797