Search-Based Duplicate Defect Detection: An Industrial Experience

被引:0
作者
Amoui, Mehdi [1 ]
Kaushik, Nilam [1 ]
Al-Dabbagh, Abraham [1 ]
Tahvildari, Ladan [1 ]
Li, Shimin [2 ]
Liu, Weining [2 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
[2] BlackBerry Inc, Waterloo, ON, Canada
来源
2013 10TH IEEE WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR) | 2013年
关键词
Duplicate Defect Detection; Parameter Tuning; Search-based Software Engineering; Information Retrieval;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Duplicate defects put extra overheads on software organizations, as the cost and effort of managing duplicate defects are mainly redundant. Due to the use of natural language and various ways to describe a defect, it is usually hard to investigate duplicate defects automatically. This problem is more severe in large software organizations with huge defect repositories and massive number of defect reporters. Ideally, an efficient tool should prevent duplicate reports from reaching developers by automatically detecting and/or filtering duplicates. It also should be able to offer defect triagers a list of top-N similar bug reports and allow them to compare the similarity of incoming bug reports with the suggested duplicates. This demand has motivated us to design and develop a search-based duplicate bug detection framework at BlackBerry. The approach follows a generalized process model to evaluate and tune the performance of the system in a systematic way. We have applied the framework on software projects at BlackBerry, in addition to the Mozilla defect repository. The experimental results exhibit the performance of the developed framework and highlight the high impact of parameter tuning on its performance.
引用
收藏
页码:173 / 182
页数:10
相关论文
共 24 条
[1]  
Nguyen AT, 2012, IEEE INT CONF AUTOM, P70, DOI 10.1145/2351676.2351687
[2]  
[Anonymous], 2012, P SPAN C INF RETR
[3]  
[Anonymous], 2008, Introduction to information retrieval
[4]  
Ashok B, 2009, 7TH JOINT MEETING OF THE EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND THE ACM SIGSOFT SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, P373
[5]   Duplicate Bug Reports Considered Harmful ... Really? [J].
Bettenburg, Nicolas ;
Premraj, Rahul ;
Zimmermann, Thomas ;
Kim, Sunghun .
2008 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, 2008, :337-345
[6]   An Initial Study on the Bug Report Duplication Problem [J].
Cavalcanti, Yguarata Cerqueira ;
de Almeida, Eduardo Santana ;
Albuquerque da Cunha, Carlos Eduardo ;
Lucredio, Daniel ;
de Lemos Meira, Silvio Romero .
14TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR 2010), 2010, :264-269
[7]  
Chengxiang Zhai, 2001, SIGIR Forum, P334
[8]  
Eiben AE, 2007, STUD COMPUT INTELL, V54, P19
[9]  
Eiben A.E., 2012, Autonomous Search, P15, DOI DOI 10.1007/978-3-642-21434-92
[10]  
Gospodnetic O., 2010, LUCENE ACTION, V2nd