Code Similarities Beyond Copy & Paste

被引:18
作者
Juergens, Elmar [1 ]
Deissenboeck, Florian [1 ]
Hummel, Benjamin [1 ]
机构
[1] Tech Univ Munich, Inst Informat, Munich, Germany
来源
14TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR 2010) | 2010年
关键词
D O I
10.1109/CSMR.2010.33
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Redundant source code hinders software maintenance, since updates have to be performed in multiple places. This holds independent of whether redundancy was created by copy&paste or by independent development of behaviorally similar code. Existing clone detection tools successfully discover syntactically similar redundant code. They thus work well for redundancy that has been created by copy&paste. But: how syntactically similar is behaviorally similar code of independent origin? This paper presents the results of a controlled experiment that demonstrates that behaviorally similar code of independent origin is highly unlikely to be syntactically similar. In fact, it is so syntactically different, that existing clone detection approaches cannot identify more than 1% of such redundancy. This is unfortunate, as manual inspections of open source software indicate that behaviorally similar code of independent origin does exist in practice and does present problems to maintenance.
引用
收藏
页码:78 / 87
页数:10
相关论文
共 26 条
[1]  
Al-Ekram R., 2005, P ESEM 05
[2]  
Alias C., 2003, WCRE 03
[3]  
[Anonymous], 2007, 541 QUEENS U KINGST
[4]  
Baxter I.D., 1998, ICSM '98
[5]  
Clarke EM, 1999, MODEL CHECKING, P1
[6]   Tool support for continuous quality control [J].
Deissenboeck, Florian ;
Juergens, Elmer ;
Hummel, Benjamin ;
Wagner, Stefan ;
Mas y Parareda, Bonedikt ;
Pizka, Markus .
IEEE SOFTWARE, 2008, 25 (05) :60-67
[7]  
Eckhardt D. E., 1991, IEEE TSE, V17
[8]  
Gabel M., 2008, ICSE 08
[9]  
Ianov I. I., 1958, COMMUN ACM
[10]  
Jiang E. C. Lingxiao, 2007, ESEC FSE 07