Bridging expert knowledge with deep learning techniques for just-in-time defect prediction

被引:0
作者
Zhou, Xin [1 ]
Han, Donggyun [2 ]
Lo, David [1 ]
机构
[1] Singapore Management Univ, Sch Comp & Informat Syst, 80 Stamford Rd, Singapore, Singapore
[2] Univ London, Dept Comp Sci Royal Holloway, Egham TW20 0EX, England
基金
新加坡国家研究基金会;
关键词
Just-in-time defect prediction; Expert knowledge; Deep learning; Multi-modal fusion; FUSION METHODS; BUGS;
D O I
10.1007/s10664-024-10591-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Just-In-Time (JIT) defect prediction aims to automatically predict whether a commit is defective or not, and has been widely studied in recent years. In general, most studies can be classified into two categories: 1) simple models using traditional machine learning classifiers with hand-crafted features, and 2) complex models using deep learning techniques to automatically extract features from commit contents. Hand-crafted features used by simple models are based on expert knowledge but may not fully represent the semantic meaning of the commits. On the other hand, deep learning-based features used by complex models represent the semantic meaning of commits but may not reflect useful expert knowledge. Simple models and complex models seem complementary to each other to some extent. To utilize the advantages of both simple and complex models, we propose a model fusion framework that adopts both early fusions on the feature level and late fusions on the decision level. We propose SimCom++ by adopting the best early and late fusion strategies. The experimental results show that SimCom++ can significantly outperform the baselines by 5.7-26.9%. In addition, our experimental results confirm that the simple model and complex model are complementary to each other.
引用
收藏
页数:44
相关论文
共 50 条
[31]   JITGNN: A deep graph neural network framework for Just-In-Time bug prediction [J].
Keshavarz, Hossein ;
Rodriguez-Perez, Gema .
JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 210
[32]   Deep Learning for Software Defect Prediction in time [J].
Yadav, Monika ;
Singh, Vijendra ;
Rastogi, Priyanka .
2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, :7-12
[33]   Revisiting supervised and unsupervised models for effort-aware just-in-time defect prediction [J].
Huang, Qiao ;
Xia, Xin ;
Lo, David .
EMPIRICAL SOFTWARE ENGINEERING, 2019, 24 (05) :2823-2862
[34]   Simple or Complex? Together for a More Accurate Just-In-Time Defect Predictor [J].
Zhou, Xin ;
Han, DongGyun ;
Lo, David .
30TH IEEE/ACM INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2022), 2022, :229-240
[35]   PyExplainer: Explaining the Predictions of Just-In-Time Defect Models [J].
Pornprasit, Chanathip ;
Tantithamthavorn, Chakkrit ;
Jiarpakdee, Jirayus ;
Fu, Michael ;
Thongtanunam, Patanamon .
2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, :407-418
[36]   Extending Developer Experience Metrics for Better Effort-Aware Just-In-Time Defect Prediction [J].
Cho, Yeongjun ;
Kwon, Jung-Hyun ;
Yi, Jooyong ;
Ko, In-Young .
IEEE ACCESS, 2022, 10 :128218-128231
[37]   MULTI: Multi-objective effort-aware just-in-time software defect prediction [J].
Chen, Xiang ;
Zhao, Yingquan ;
Wang, Qiuping ;
Yuan, Zhidan .
INFORMATION AND SOFTWARE TECHNOLOGY, 2018, 93 :1-13
[38]   An Empirical Analysis on Just-In-Time Defect Prediction Models for Self-driving Software Systems [J].
Choi, Jiwon ;
Manikandan, Saranya ;
Ryu, Duksan ;
Baik, Jongmoon .
FRONTIERS OF COMPUTER VISION, IW-FCV 2024, 2024, 2143 :34-45
[39]   An Empirical Analysis on Just-In-Time Defect Prediction Models for Self-driving Software Systems [J].
Choi, Jiwon ;
Manikandan, Saranya ;
Ryu, Duksan ;
Baik, Jongmoon .
CURRENT TRENDS IN WEB ENGINEERING, ICWE 2022 INTERNATIONAL WORKSHOPS, 2023, 1668 :34-45
[40]   NeuroJIT: Improving Just-In-Time Defect Prediction Using Neurophysiological and Empirical Perceptions of Modern Developers [J].
Lee, Gichan ;
Ju, Hansae ;
Lee, Scott Uk-Jin .
PROCEEDINGS OF 2024 39TH ACM/IEEE INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE 2024, 2024, :594-605