On the Effectiveness of Information Retrieval Based Bug Localization for C Programs

被引:26
作者
Saha, Ripon K. [1 ]
Lawall, Julia [2 ]
Khurshid, Sarfraz [1 ]
Perry, Dewayne E. [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Univ Paris 04, UPMC, Inria, LIP6, F-75230 Paris 05, France
来源
2014 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME) | 2014年
关键词
Bug Localization; Information Retrieval; Search;
D O I
10.1109/ICSME.2014.38
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Localizing bugs is important, difficult, and expensive, especially for large software projects. To address this problem, information retrieval (IR) based bug localization has increasingly been used to suggest potential buggy files given a bug report. To date, researchers have proposed a number of IR techniques for bug localization and empirically evaluated them to understand their effectiveness. However, virtually all of the evaluations have been limited to the projects written in object-oriented programming languages, particularly Java. Therefore, the effectiveness of these techniques for other widely-used languages such as C is still unknown. In this paper, we create a benchmark dataset consisting of more than 7,500 bug reports from five popular C projects and rigorously evaluate our recently introduced IR-based bug localization tool using this dataset. Our results indicate that although the IR-relevant properties of C and Java programs are different, IR-based bug localization in C software at the file level is overall as effective as in Java software. However, we also find that the recent advance of using program structure information in performing bug localization gives less of a benefit for C software than for Java software.
引用
收藏
页码:161 / 170
页数:10
相关论文
共 22 条
  • [1] Anh Tuan Nguyen, 2011, 2011 26th IEEE/ACM International Conference on Automated Software Engineering, P263, DOI 10.1109/ASE.2011.6100062
  • [2] [Anonymous], 2008, Introduction to information retrieval
  • [3] [Anonymous], ICPC
  • [4] Davies S., 2012, 2012 19th Working Conference on Reverse Engineering (WCRE), P125, DOI 10.1109/WCRE.2012.22
  • [5] Kochhar P. S., 2014, ASE
  • [6] Extended comprehensive study of association measures for fault localization
    Lucia
    Lo, David
    Jiang, Lingxiao
    Thung, Ferdian
    Budi, Aditya
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2014, 26 (02) : 172 - 219
  • [7] Bug localization using latent Dirichlet allocation
    Lukins, Stacy K.
    Kraft, Nicholas A.
    Etzkorn, Letha H.
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2010, 52 (09) : 972 - 990
  • [8] MacKenzie D., 2003, COMP MERGING FILES G
  • [9] An information retrieval approach to concept location in source code
    Marcus, A
    Sergeyev, A
    Rajlich, V
    Maletic, JI
    [J]. 11TH WORKING CONFERENCE ON REVERSE ENGINEERING, PROCEEDINGS, 2004, : 214 - 223
  • [10] Padioleau Y, 2008, EUROSYS'08: PROCEEDINGS OF THE EUROSYS 2008 CONFERENCE, P247, DOI 10.1145/1357010.1352618