Improving IR-Based Bug Localization with Context-Aware Query Reformulation

被引:77
作者
Rahman, Mohammad Masudur [1 ]
Roy, Chanchal K. [1 ]
机构
[1] Univ Saskatchewan, Saskatoon, SK, Canada
来源
ESEC/FSE'18: PROCEEDINGS OF THE 2018 26TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING | 2018年
基金
加拿大自然科学与工程研究理事会;
关键词
Debugging automation; bug localization; bug report quality; query reformulation; information retrieval; graph-based term weighting; FAULT LOCALIZATION;
D O I
10.1145/3236024.3236065
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recent findings suggest that Information Retrieval (IR)-based bug localization techniques do not perform well if the bug report lacks rich structured information (e.g., relevant program entity names). Conversely, excessive structured information (e.g., stack traces) in the bug report might not always help the automated localization either. In this paper, we propose a novel technique-BLIZZARD-that automatically localizes buggy entities from project source using appropriate query reformulation and effective information retrieval. In particular, our technique determines whether there are excessive program entities or not in a bug report (query), and then applies appropriate reformulations to the query for bug localization. Experiments using 5,139 bug reports show that our technique can localize the buggy source documents with 7%-56% higher Hit@10, 6%-62% higher MAP@10 and 6%-62% higher MRR@10 than the baseline technique. Comparison with the state-of-the-art techniques and their variants report that our technique can improve 19% in MAP@10 and 20% in MRR@10 over the state-of-the-art, and can improve 59% of the noisy queries and 39% of the poor queries.
引用
收藏
页码:621 / 632
页数:12
相关论文
共 68 条
[41]  
Rahman MM, 2017, IEEE INT CONF AUTOM, P428, DOI 10.1109/ASE.2017.8115655
[42]  
Rahman MM, 2017, 2017 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), P79, DOI 10.1109/SANER.2017.7884611
[43]   QUICKAR: Automatic Query Reformulation for Concept Location using Crowdsourced Knowledge [J].
Rahman, Mohammad Masudur ;
Roy, Chanchal K. .
2016 31ST IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2016, :220-225
[44]  
Rao S., 2011, 8 WORK C MIN SOFTW R, P43, DOI 10.1145/1985441.1985451
[45]  
Rastkar S., 2010, Proc. 32nd Int'l Conf. Software Eng. (ICSE 10), P505, DOI 10.1145
[46]  
Rigby PC, 2013, PROCEEDINGS OF THE 35TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2013), P832, DOI 10.1109/ICSE.2013.6606629
[47]   ON TERM SELECTION FOR QUERY EXPANSION [J].
ROBERTSON, SE .
JOURNAL OF DOCUMENTATION, 1990, 46 (04) :359-364
[48]  
Rocchio J. J., SMART RETRIEVAL SYST, P313
[49]   On the Effectiveness of Information Retrieval Based Bug Localization for C Programs [J].
Saha, Ripon K. ;
Lawall, Julia ;
Khurshid, Sarfraz ;
Perry, Dewayne E. .
2014 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2014, :161-170
[50]  
Saha RK, 2013, IEEE INT CONF AUTOM, P345, DOI 10.1109/ASE.2013.6693093