On the classification of bug reports to improve bug localization

被引:0
作者
Fan Fang
John Wu
Yanyan Li
Xin Ye
Wajdi Aljedaani
Mohamed Wiem Mkaouer
机构
[1] California State University San Marcos,
[2] University of North Texas,undefined
[3] Rochester Institute of Technology,undefined
来源
Soft Computing | 2021年 / 25卷
关键词
Bug classification; Bug localization; Bug report quality; Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Bug localization is the automated process of finding the possible faulty files in a software project. Bug localization allows developers to concentrate on vital files. Information retrieval (IR)-based approaches have been proposed to assist automatically identify software defects by using bug report information. However, some bug reports that are not semantically related to the relevant code are not helpful to IR-based systems. Running an IR-based reporting system can lead to false-positive results. In this paper, we propose a classification model for classifying a bug report as either uninformative or informative. Our approach helps to lower false positives and increase ranking performances by filtering uninformative information before running an IR-based bug location system. The model is based on implicit features learned from bug reports that use neural networks and explicit features defined manually. We test our proposed model on three open-source software projects that contain over 9000 bug reports. The results of the evaluation show that our model enhances the efficiency of a developed IR-based system in the trade-off between precision and recall. For implicit features, our tests with comparisons show that the LSTM network performs better than the CNN and multilayer perceptron with respect to the F-measurements. Combining both implicit and explicit features outperforms using only implicit features. Our classification model helps improve precision in bug localization tasks when precision is considered more important than recall.
引用
收藏
页码:7307 / 7323
页数:16
相关论文
共 72 条
[1]  
Anvik J(2011)Reducing the effort of bug report triage: recommenders for development-oriented decisions ACM Trans Softw Eng Methodol 20 1-10
[2]  
Murphy GC(1994)Learning long-term dependencies with gradient descent is difficult Trans Neural Netw 5 157-166
[3]  
Bengio Y(2012)Automated, highly-accurate, bug assignment using machine learning and tossing graphs J Syst Softw 85 2275-2292
[4]  
Simard P(2019)Using bug descriptions to reformulate queries during text-retrieval-based bug localization Empir Softw Eng 24 2947-3007
[5]  
Frasconi P(2018)A deep learning model for estimating story points IEEE Trans Softw Eng 18 277-309
[6]  
Bhattacharya P(2013)Integrating information retrieval, execution and link analysis algorithms to improve feature location in software Empir Softw Engg 45 1002-1023
[7]  
Neamtiu I(2018)Network-clustered multi-modal bug localization IEEE Trans Softw Eng 9 1735-1780
[8]  
Shelton CR(1997)Long short-term memory Neural Comput 2 359-366
[9]  
Chaparro O(1989)Multilayer feedforward networks are universal approximators Neural Netw 93 45-57
[10]  
Florez JM(2018)Just enough semantics: an information theoretic approach for IR-based software bug localization Inf Softw Technol 39 1597-1610