Effort-Aware semi-Supervised just-in-Time defect prediction

被引:37
作者
Li, Weiwei [1 ]
Zhang, Wenzhou [2 ]
Jia, Xiuyi [2 ]
Huang, Zhiqiu [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Astronaut, Nanjing 210016, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Defect prediction; Just-in-time; Tri-training; Effort-aware; SOFTWARE; MODELS;
D O I
10.1016/j.infsof.2020.106364
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Software defect prediction is an important technique that can help practitioners allocate their quality assurance efforts. In recent years, just-in-time (JIT) defect prediction has attracted considerable interest, as it enables developers to identify risky changes at check-in time. Objective: Many studies have conducted research from supervised and unsupervised perspectives. A model that does not rely on label information would be preferred. However, the performance of unsupervised models proposed by previous studies in the classification scenario was unsatisfactory due to the lack of supervised information. Furthermore, most supervised models fail to outperform simple unsupervised models in the ranking scenario. To overcome this weakness, we conduct research from the semi-supervised perspective that only requires a small quantity of labeled data for training. Method: In this paper, we propose a semi-supervised model for JIT defect prediction named Effort-Aware TriTraining (EATT), which is an effort-aware method using a greedy strategy to rank changes. We compare EATT with the state-of-the-art supervised and unsupervised models with respect to different labeled rate. Results: The experimental results on six open-source projects demonstrate that EATT outperforms existing supervised and unsupervised models for effort-aware JIT defect prediction, and has similar or superior performance in classifying defect-inducing changes. Conclusion: The results show that EATT can not only achieve high classification accuracy as supervised models, but also offer more practical value than other compared models from the perspective of the effort needed to review changes.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Finding the best learning to rank algorithms for effort-aware defect prediction
    Yu, Xiao
    Dai, Heng
    Li, Li
    Gu, Xiaodong
    Keung, Jacky Wai
    Bennin, Kwabena Ebo
    Li, Fuyang
    Liu, Jin
    INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 157
  • [22] Empirical Evaluation of Cross-Release Effort-Aware Defect Prediction Models
    Bennin, Kwabena Ebo
    Toda, Koji
    Kamei, Yasutaka
    Keung, Jacky
    Monden, Akito
    Ubayashi, Naoyasu
    2016 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2016), 2016, : 214 - 221
  • [23] MVSE: Effort-Aware Heterogeneous Defect Prediction via Multiple-View Spectral Embedding
    Xu, Zhou
    Ye, Sizhe
    Zhang, Tao
    Xia, Zhen
    Pang, Shuai
    Wang, Yong
    Tang, Yutian
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2019), 2019, : 10 - 17
  • [24] Improving classifier-based effort-aware software defect prediction by reducing ranking errors
    Guo, Yuchen
    Shepperd, Martin
    Li, Ning
    PROCEEDINGS OF 2024 28TH INTERNATION CONFERENCE ON EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING, EASE 2024, 2024, : 160 - 169
  • [25] Bug numbers matter: An empirical study of effort-aware defect prediction using class labels versus bug numbers
    Yang, Peixin
    Zeng, Ziyao
    Zhu, Lin
    Zhang, Yanjiao
    Wang, Xin
    Ma, Chuanxiang
    Hu, Wenhua
    SOFTWARE-PRACTICE & EXPERIENCE, 2025, 55 (01) : 49 - 78
  • [26] Leveraging developer information for efficient effort-aware bug prediction
    Qu, Yu
    Chi, Jianlei
    Yin, Heng
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 137
  • [27] Empirical analysis of network measures for effort-aware fault-proneness prediction
    Ma, Wanwangying
    Chen, Lin
    Yang, Yibiao
    Zhou, Yuming
    Xu, Baowen
    INFORMATION AND SOFTWARE TECHNOLOGY, 2016, 69 : 50 - 70
  • [28] The Impact of Duplicate Changes on Just-in-Time Defect Prediction
    Duan, Ruifeng
    Xu, Haitao
    Fan, Yuanrui
    Yan, Meng
    IEEE TRANSACTIONS ON RELIABILITY, 2022, 71 (03) : 1294 - 1308
  • [29] ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction
    Keshavarz, Hossein
    Nagappan, Meiyappan
    2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022), 2022, : 191 - 195
  • [30] The impact of context metrics on just-in-time defect prediction
    Kondo, Masanari
    German, Daniel M.
    Mizuno, Osamu
    Choi, Eun-Hye
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (01) : 890 - 939