Which bug reports are valid and why? Using the BERT transformer to classify bug reports and explain their validity

被引:0
作者
Meng, Qianru [1 ]
Joost, Visser [1 ]
机构
[1] Leiden Univ, LIACS, Leiden, Netherlands
来源
PROCEEDINGS OF THE 4TH EUROPEAN SYMPOSIUM ON SOFTWARE ENGINEERING, ESSE 2023 | 2024年
关键词
Bug report; BERT; Transformer; Explanation; Deep Learning; Classification;
D O I
10.1145/3651640.3651648
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Defects and other issues regarding quality and performance of software systems are reported and stored in defect-tracking systems. For software developers, classifying valid bug reports in large defect repositories is challenging. Partly, this is because bug reports commonly contain noise and domain-specific terms. Deep learning, with its strong learning ability from complex text, offers a solution to this issue. However, merely focusing on the performance of the model is not sufficient - clarifying the classification decisions of the system holds equal significance. Our method uses BERT to perform the bug report validity classification task, and we test various mechanisms to explain the classification. Through rigorous evaluation on five open source datasets and benchmarking BERT's performance against CNN, we demonstrate significant improvements in recall, precision, and F1 score. Importantly, through the evaluation of results from three explanation techniques, our method effectively identifies key features essential to the validity and quality of bug reports.
引用
收藏
页码:52 / 60
页数:9
相关论文
共 21 条
[1]  
Anvik J, 2005, P 2005 OOPSLA WORKSH, P35, DOI [DOI 10.1145/1117696, DOI 10.1145/1117696.1117704]
[2]   Bug Report Quality Evaluation Considering the Effect of Submitter Reputation [J].
Aversano, Lerina ;
Tedeschi, Ermanno .
ICSOFT-EA: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON SOFTWARE TECHNOLOGIES - VOL. 1, 2016, :194-201
[3]   An Empirical Analysis of Bug Reports and Bug Fixing in Open Source Android Apps [J].
Bhattacharya, Pamela ;
Ulanova, Liudmila ;
Neamtiu, Iulian ;
Koduru, Sai Charan .
PROCEEDINGS OF THE 17TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR 2013), 2013, :133-143
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Chen X, 2018, 2018 25TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2018), P368, DOI 10.1109/SANER.2018.8330224
[7]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]  
Fan X., 2014, Journal of Software Engineering, V4, P1
[10]   Deep Learning Based Valid Bug Reports Determination and Explanation [J].
He, Jianjun ;
Xu, Ling ;
Fan, Yuanrui ;
Xu, Zhou ;
Yan, Meng ;
Lei, Yan .
2020 IEEE 31ST INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2020), 2020, :184-194