Comparison and evaluation of clone detection tools

被引:454
作者
Bellon, Stefan
Koschke, Rainer
Antoniol, Giuliano
Krinke, Jens
Merlo, Ettore
机构
[1] Axiv GmbH, D-70569 Stuttgart, Germany
[2] Univ Bremen, D-28334 Bremen, Germany
[3] Ecole Polytech, Dept Genie Informat, Montreal, PQ H3T 1J4, Canada
[4] Fernuniv, D-58097 Hagen, Germany
[5] Ecole Polytech, Dept Comp Engn, Montreal, PQ H3C 3A7, Canada
关键词
code; duplicated code; software clones;
D O I
10.1109/TSE.2007.70725
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many techniques for detecting duplicated source code (software clones) have been proposed in the past. However, it is not yet clear how these techniques compare in terms of recall and precision as well as space and time requirements. This paper presents an experiment that evaluates six clone detectors based on eight large C and Java programs (altogether almost 850 KLOC). Their clone candidates were evaluated by one of the authors as an independent third party. The selected techniques cover the whole spectrum of the state-of-the-art in clone detection. The techniques work on text, lexical and syntactic information, software metrics, and program dependency graphs.
引用
收藏
页码:577 / 591
页数:15
相关论文
共 38 条
[1]  
[Anonymous], 1994, P 1994 C CTR ADV STU
[2]  
BAILEY J, 2002, P 2 IEEE INT WORKSH, P36
[3]  
BAKER BS, 1995, SECOND WORKING CONFERENCE ON REVERSE ENGINEERING, PROCEEDINGS, P86, DOI 10.1109/WCRE.1995.514697
[4]  
BAKER BS, 1992, COMPUTING SCIENCE AND STATISTICS : VOL 24, P49
[5]   Parameterized pattern matching: Algorithms and applications [J].
Baker, BS .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1996, 52 (01) :28-42
[6]  
BAXTER ID, 1998, P INT C SOFTW MAINT
[7]  
BELLON S, 2002, THESIS U STUTTGART G
[8]  
BELLON S, 2007, DETECTION SOFTWARE C
[9]   On the use of clone detection for identifying crosscutting concern code [J].
Bruntink, M ;
van Deursen, A ;
van Engelen, R ;
Tourwé, T .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2005, 31 (10) :804-818
[10]  
Cordy J R., 2004, Proc. LDTA 2004, ACM 4th International Workshop on Language Descriptions, P1