The forgotten role of search queries in IR-based bug localization: an empirical study

被引:0
作者
Mohammad Masudur Rahman
Foutse Khomh
Shamima Yeasmin
Chanchal K. Roy
机构
[1] Dalhousie University,
[2] Polytechnique Montréal,undefined
[3] University of Saskatchewan,undefined
来源
Empirical Software Engineering | 2021年 / 26卷
关键词
Debugging automation; Bug localization; Information retrieval; Natural language processing; Query construction; Keyword selection; Genetic algorithm; Optimal search query; Poor search query; Empirical study;
D O I
暂无
中图分类号
学科分类号
摘要
Being light-weight and cost-effective, IR-based approaches for bug localization have shown promise in finding software bugs. However, the accuracy of these approaches heavily depends on their used bug reports. A significant number of bug reports contain only plain natural language texts. According to existing studies, IR-based approaches cannot perform well when they use these bug reports as search queries. On the other hand, there is a piece of recent evidence that suggests that even these natural language-only reports contain enough good keywords that could help localize the bugs successfully. On one hand, these findings suggest that natural language-only bug reports might be a sufficient source for good query keywords. On the other hand, they cast serious doubt on the query selection practices in the IR-based bug localization. In this article, we attempted to clear the sky on this aspect by conducting an in-depth empirical study that critically examines the state-of-the-art query selection practices in IR-based bug localization. In particular, we use a dataset of 2,320 bug reports, employ ten existing approaches from the literature, exploit the Genetic Algorithm-based approach to construct optimal, near-optimal search queries from these bug reports, and then answer three research questions. We confirmed that the state-of-the-art query construction approaches are indeed not sufficient for constructing appropriate queries (for bug localization) from certain natural language-only bug reports. However, these bug reports indeed contain high-quality search keywords in their texts even though they might not contain explicit hints for localizing bugs (e.g., stack traces). We also demonstrate that optimal queries and non-optimal queries chosen from bug report texts are significantly different in terms of several keyword characteristics (e.g., frequency, entropy, position, part of speech). Such an analysis has led us to four actionable insights on how to choose appropriate keywords from a bug report. Furthermore, we demonstrate 27%–34% improvement in the performance of non-optimal queries through the application of our actionable insights to them. Finally, we summarize our study findings with future research directions (e.g., machine intelligence in keyword selection).
引用
收藏
相关论文
共 48 条
[1]  
Blanco R(2012)Graph-based term weighting for information retrieval Inf Retr 15 54-92
[2]  
Lioma C(1998)The anatomy of a large-scale hypertextual web search engine Comput Netw ISDN Syst 30 107-117
[3]  
Brin S(2012)A survey of automatic query expansion in information retrieval ACM Comput Surv 44 1:1-1:50
[4]  
Page L(1987)The vocabulary problem in human-system communication Commun ACM 30 964-971
[5]  
Carpineto C(2012)GenProg a generic method for automatic software repair TSE 38 54-72
[6]  
Romano G(1972)A statistical interpretation of term specificity and its application in retrieval J Doc 28 11-21
[7]  
Furnas GW(1995)Wordnet: A lexical database for english Commun ACM 38 39-41
[8]  
Landauer TK(2017)Predicting query quality for applications of text retrieval to software engineering tasks TOSEM 26 3:1-3:45
[9]  
Gomez LM(2018)Automatic query reformulations for feature location in a model-based family of software products Data Knowl Eng 116 159-176
[10]  
Dumais ST(2007)Feature location using probabilistic ranking of methods based on execution scenarios and information retrieval TSE 33 420-432