Online cross-project approach with project-level similarity for just-in-time software defect prediction

被引:0
作者
Teng, Cong [1 ,2 ]
Song, Liyan [3 ]
Yao, Xin [4 ]
机构
[1] Southern Univ Sci & Technol, Res Inst Trustworthy Autonomous Syst, Shenzhen, Peoples R China
[2] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Guangdong Prov Key Lab Brain Inspired Intelligent, Shenzhen, Peoples R China
[3] Harbin Inst Technol, Fac Comp, Harbin, Peoples R China
[4] Lingnan Univ, Sch Data Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Just-in-time software defect prediction; Cross-project; Online learning; Project-level similarity; Verification latency;
D O I
10.1007/s10664-024-10551-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The adoption of additional Other Project (OP) data has shown to be effective for online Just-In-Time Software Defect Prediction (JIT-SDP). However, state-of-the-art online Cross-Project (CP) methods, such as All-In-One (AIO) and Filtering, which operate at the data-level, encounter the difficulties in balancing diversity and validity of the selected OP data, which can negatively impact predictive performance. AIO may select unrelated OP data, resulting in a lack of validity, while Filtering tends to select OP data that closely resemble Target Project (TP) data, leading to a lack of diversity. To address this validity-vs-diversity challenge, a promising approach is to utilize an online project-level OP selection methodology. This approach selects instructive other projects that exhibit similarities to TP and can positively impact predictive performance, achieving better data validity compared to AIO and maintaining higher diversity compared to Filtering. To accomplish this, we propose a project-level Cross-Project method with Similarity (CroPS), which employs appropriate project-level similarity metrics to identify instructive other projects for model updating over time. CroPS applies a specified threshold to determine the selection of other projects at any given moment. Furthermore, we propose an ensemble-like framework called Multi-threshold CroPS (Multi-CroPS), which incorporates multiple threshold options for selecting other projects and poses the importance of defect-inducing changes. Experimental results based on 23 open-source projects validate the effectiveness of our project-level metrics for calculating similarities between projects. The results also demonstrate that CroPS significantly enhances the predictive performance while reducing computational costs compared to existing data-level CP approaches. Moreover, Multi-CroPS achieves significantly better performance than state-of-the-art CP approaches including our CroPS.
引用
收藏
页数:53
相关论文
共 48 条
[1]   PR-SZZ: How pull requests can support the tracing of defects in software repositories [J].
Bludau, Peter ;
Pretschner, Alexander .
2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2022), 2022, :1-12
[2]   An investigation of online and offline learning models for online Just-in-Time Software Defect Prediction [J].
Cabral, George G. ;
Minku, Leandro L. ;
Oliveira, Adriano L. I. ;
Pessoa, Dinaldo A. ;
Tabassum, Sadia .
EMPIRICAL SOFTWARE ENGINEERING, 2023, 28 (05)
[3]   Towards Reliable Online Just-in-Time Software Defect Prediction [J].
Cabral, George G. ;
Minku, Leandro L. .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (03) :1342-1358
[4]   Class Imbalance Evolution and Verification Latency in Just-in-Time Software Defect Prediction [J].
Cabral, George G. ;
Minku, Leandro L. ;
Shihab, Emad ;
Mujahid, Suhaib .
2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), 2019, :666-676
[5]   The Matthews Correlation Coefficient (MCC) is More Informative Than Cohen's Kappa and Brier Score in Binary Classification Assessment [J].
Chicco, Davide ;
Warrens, Matthijs J. ;
Jurman, Giuseppe .
IEEE ACCESS, 2021, 9 :78368-78381
[6]   The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation [J].
Chicco, Davide ;
Jurman, Giuseppe .
BMC GENOMICS, 2020, 21 (01)
[7]  
Cho Y., 2018, 6 INT WORKSH QUANT A, P2
[8]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[9]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[10]   Software contributions to aircraft adverse events: Case studies and analyses of recurrent accident patterns and failure mechanisms [J].
Favaro, Francesca M. ;
Jackson, David W. ;
Saleh, Joseph H. ;
Mavris, Dimitri N. .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2013, 113 :131-142