Improving Defect Localization by Classifying the Affected Asset Using Machine Learning

被引:2
作者
Halali, Sam [1 ]
Staron, Miroslaw [1 ]
Ochodek, Miroslaw [1 ,2 ]
Meding, Wilhelm [3 ]
机构
[1] Chalmers Univ Gothenburg, Gothenburg, Sweden
[2] Poznan Univ Tech, Inst Comp Sci, Poznan, Poland
[3] Ericsson AB, Gothenburg, Sweden
来源
SOFTWARE QUALITY: THE COMPLEXITY AND CHALLENGES OF SOFTWARE ENGINEERING AND SOFTWARE QUALITY IN THE CLOUD | 2019年 / 338卷
关键词
Defect localization; Machine learning; Case study; BUG; ACCURATE;
D O I
10.1007/978-3-030-05767-1_8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A vital part of a defect's resolution is the task of defect localization. Defect localization is the task of finding the exact location of the defect in the system. The defect report, in particular, the asset attribute, helps the person assigned to handle the problem to limit the search space when investigating the exact location of the defect. However, research has shown that oftentimes reporters initially assign values to these attributes that provide incorrect information. In this paper, we propose and evaluate the way of automatically identifying the location of a defect using machine learning to classify the source asset. By training an Support-Vector-Machine (SVM) classifier with features constructed from both categorical and textual attributes of the defect reports we achieved an accuracy of 58.52% predicting the source asset. However, when we trained an SVM to provide a list of recommendations rather than a single prediction, the recall increased to up to 92.34%. Given these results, we conclude that software development teams can use these algorithms to predict up to ten potential locations, but already with three predicted locations, the teams can get useful results with the accuracy of over 70%.
引用
收藏
页码:106 / 122
页数:17
相关论文
共 22 条
[1]  
[Anonymous], P 11 INT C EV ASS SO
[2]  
[Anonymous], SCIKIT LEARN SCIKIT
[3]  
Anvik J, 2006, P 28 INT C SOFTW ENG, V2006, P361, DOI DOI 10.1145/1134285.1134336
[4]  
Banitaan S, 2013, INT CONF COMM INF T, P215, DOI 10.1109/ICCITechnology.2013.6579552
[5]   Automated, highly-accurate, bug assignment using machine learning and tossing graphs [J].
Bhattacharya, Pamela ;
Neamtiu, Lulian ;
Shelton, Christian R. .
JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (10) :2275-2292
[6]   Speed, Data, and Ecosystems The Future of Software Engineering [J].
Bosch, Jan .
IEEE SOFTWARE, 2016, 33 (01) :82-88
[7]   Challenges and opportunities for software change request repositories: a systematic mapping study [J].
Cavalcanti, Yguarata Cerqueira ;
da Mota Silveira Neto, Paulo Anselmo ;
Machado, Ivan do Carmo ;
Vale, Tassio Ferreira ;
de Almeida, Eduardo Santana ;
de Lemos Meira, Silvio Romero .
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2014, 26 (07) :620-653
[8]   Machine learning or information retrieval techniques for bug triaging: Which is better? [J].
Goyal, Anjali ;
Sardana, Neetu .
E-Informatica Software Engineering Journal, 2017, 11 (01) :117-141
[9]   Automated Duplicate Detection for Bug Tracking Systems [J].
Jalbert, Nicholas ;
Weimer, Westley .
2008 IEEE INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS & NETWORKS WITH FTCS & DCC, 2008, :52-61
[10]   Automated bug assignment: Ensemble-based machine learning in large scale industrial contexts [J].
Jonsson, Leif ;
Borg, Markus ;
Broman, David ;
Sandahl, Kristian ;
Eldh, Sigrid ;
Runeson, Per .
EMPIRICAL SOFTWARE ENGINEERING, 2016, 21 (04) :1533-1578