Effort-Aware semi-Supervised just-in-Time defect prediction

被引:37
作者
Li, Weiwei [1 ]
Zhang, Wenzhou [2 ]
Jia, Xiuyi [2 ]
Huang, Zhiqiu [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Astronaut, Nanjing 210016, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Defect prediction; Just-in-time; Tri-training; Effort-aware; SOFTWARE; MODELS;
D O I
10.1016/j.infsof.2020.106364
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Software defect prediction is an important technique that can help practitioners allocate their quality assurance efforts. In recent years, just-in-time (JIT) defect prediction has attracted considerable interest, as it enables developers to identify risky changes at check-in time. Objective: Many studies have conducted research from supervised and unsupervised perspectives. A model that does not rely on label information would be preferred. However, the performance of unsupervised models proposed by previous studies in the classification scenario was unsatisfactory due to the lack of supervised information. Furthermore, most supervised models fail to outperform simple unsupervised models in the ranking scenario. To overcome this weakness, we conduct research from the semi-supervised perspective that only requires a small quantity of labeled data for training. Method: In this paper, we propose a semi-supervised model for JIT defect prediction named Effort-Aware TriTraining (EATT), which is an effort-aware method using a greedy strategy to rank changes. We compare EATT with the state-of-the-art supervised and unsupervised models with respect to different labeled rate. Results: The experimental results on six open-source projects demonstrate that EATT outperforms existing supervised and unsupervised models for effort-aware JIT defect prediction, and has similar or superior performance in classifying defect-inducing changes. Conclusion: The results show that EATT can not only achieve high classification accuracy as supervised models, but also offer more practical value than other compared models from the perspective of the effort needed to review changes.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] The Impact of Duplicate Changes on Just-in-Time Defect Prediction
    Duan, Ruifeng
    Xu, Haitao
    Fan, Yuanrui
    Yan, Meng
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2022, 71 (03) : 1294 - 1308
  • [32] The impact of context metrics on just-in-time defect prediction
    Kondo, Masanari
    German, Daniel M.
    Mizuno, Osamu
    Choi, Eun-Hye
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (01) : 890 - 939
  • [33] Fine-grained just-in-time defect prediction
    Pascarella, Luca
    Palomba, Fabio
    Bacchelli, Alberto
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 150 : 22 - 36
  • [34] The impact of context metrics on just-in-time defect prediction
    Masanari Kondo
    Daniel M. German
    Osamu Mizuno
    Eun-Hye Choi
    [J]. Empirical Software Engineering, 2020, 25 : 890 - 939
  • [35] Just-In-Time Defect Prediction on Java']JavaScript Projects: A Replication Study
    Ni, Chao
    Xia, Xin
    Lo, David
    Yang, Xiaohu
    Hassan, Ahmed E.
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2022, 31 (04)
  • [36] A multi-objective effort-aware defect prediction approach based on NSGA-II
    Yu, Xiao
    Liu, Liming
    Zhu, Lin
    Keung, Jacky Wai
    Wang, Zijian
    Li, Fuyang
    [J]. APPLIED SOFT COMPUTING, 2023, 149
  • [37] An improved semi-supervised learning method for software defect prediction
    Ma, Ying
    Pan, Weiwei
    Zhu, Shunzhi
    Yin, Huayi
    Luo, Jian
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 27 (05) : 2473 - 2480
  • [38] A Comparison of Semi-Supervised Classification Approaches for Software Defect Prediction
    Catal, Cagatay
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2014, 23 (01) : 75 - 82
  • [39] A Replication Study: Just-In-Time Defect Prediction with Ensemble Learning
    Young, Steven
    Abdou, Tamer
    Bener, Ayse
    [J]. 2018 IEEE/ACM 6TH INTERNATIONAL WORKSHOP ON REALIZING ARTIFICIAL INTELLIGENCE SYNERGIES IN SOFTWARE ENGINEERING (RAISE), 2018, : 42 - 47
  • [40] Towards Reliable Online Just-in-Time Software Defect Prediction
    Cabral, George G.
    Minku, Leandro L.
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (03) : 1342 - 1358