Effort-Aware semi-Supervised just-in-Time defect prediction

被引：37

作者：

Li, Weiwei ^{[1
]}

Zhang, Wenzhou ^{[2
]}

Jia, Xiuyi ^{[2
]}

Huang, Zhiqiu ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Astronaut, Nanjing 210016, Peoples R China

[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China

[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China

来源：

INFORMATION AND SOFTWARE TECHNOLOGY | 2020年 / 126卷

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Defect prediction; Just-in-time; Tri-training; Effort-aware; SOFTWARE; MODELS;

D O I：

10.1016/j.infsof.2020.106364

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Context: Software defect prediction is an important technique that can help practitioners allocate their quality assurance efforts. In recent years, just-in-time (JIT) defect prediction has attracted considerable interest, as it enables developers to identify risky changes at check-in time. Objective: Many studies have conducted research from supervised and unsupervised perspectives. A model that does not rely on label information would be preferred. However, the performance of unsupervised models proposed by previous studies in the classification scenario was unsatisfactory due to the lack of supervised information. Furthermore, most supervised models fail to outperform simple unsupervised models in the ranking scenario. To overcome this weakness, we conduct research from the semi-supervised perspective that only requires a small quantity of labeled data for training. Method: In this paper, we propose a semi-supervised model for JIT defect prediction named Effort-Aware TriTraining (EATT), which is an effort-aware method using a greedy strategy to rank changes. We compare EATT with the state-of-the-art supervised and unsupervised models with respect to different labeled rate. Results: The experimental results on six open-source projects demonstrate that EATT outperforms existing supervised and unsupervised models for effort-aware JIT defect prediction, and has similar or superior performance in classifying defect-inducing changes. Conclusion: The results show that EATT can not only achieve high classification accuracy as supervised models, but also offer more practical value than other compared models from the perspective of the effort needed to review changes.

引用

页数：17

共 50 条

[21] Finding the best learning to rank algorithms for effort-aware defect prediction
Yu, Xiao
Dai, Heng
Li, Li
Gu, Xiaodong
Keung, Jacky Wai
Bennin, Kwabena Ebo
Li, Fuyang
Liu, Jin
INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 157
[22] Empirical Evaluation of Cross-Release Effort-Aware Defect Prediction Models
Bennin, Kwabena Ebo
Toda, Koji
Kamei, Yasutaka
Keung, Jacky
Monden, Akito
Ubayashi, Naoyasu
2016 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2016), 2016, : 214 - 221
[23] MVSE: Effort-Aware Heterogeneous Defect Prediction via Multiple-View Spectral Embedding
Xu, Zhou
Ye, Sizhe
Zhang, Tao
Xia, Zhen
Pang, Shuai
Wang, Yong
Tang, Yutian
2019 IEEE 19TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2019), 2019, : 10 - 17
[24] Improving classifier-based effort-aware software defect prediction by reducing ranking errors
Guo, Yuchen
Shepperd, Martin
Li, Ning
PROCEEDINGS OF 2024 28TH INTERNATION CONFERENCE ON EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING, EASE 2024, 2024, : 160 - 169
[25] Bug numbers matter: An empirical study of effort-aware defect prediction using class labels versus bug numbers
Yang, Peixin
Zeng, Ziyao
Zhu, Lin
Zhang, Yanjiao
Wang, Xin
Ma, Chuanxiang
Hu, Wenhua
SOFTWARE-PRACTICE & EXPERIENCE, 2025, 55 (01) : 49 - 78
[26] Leveraging developer information for efficient effort-aware bug prediction
Qu, Yu
Chi, Jianlei
Yin, Heng
INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 137
[27] Empirical analysis of network measures for effort-aware fault-proneness prediction
Ma, Wanwangying
Chen, Lin
Yang, Yibiao
Zhou, Yuming
Xu, Baowen
INFORMATION AND SOFTWARE TECHNOLOGY, 2016, 69 : 50 - 70
[28] The Impact of Duplicate Changes on Just-in-Time Defect Prediction
Duan, Ruifeng
Xu, Haitao
Fan, Yuanrui
Yan, Meng
IEEE TRANSACTIONS ON RELIABILITY, 2022, 71 (03) : 1294 - 1308
[29] ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction
Keshavarz, Hossein
Nagappan, Meiyappan
2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022), 2022, : 191 - 195
[30] The impact of context metrics on just-in-time defect prediction
Kondo, Masanari
German, Daniel M.
Mizuno, Osamu
Choi, Eun-Hye
EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (01) : 890 - 939

← 1 2 3 4 5 →