Gapped Code Clone Detection with Lightweight Source Code Analysis

被引:0
作者
Murakami, Hiroaki [1 ]
Hotta, Keisuke [1 ]
Higo, Yoshiki [1 ]
Igaki, Hiroshi [1 ]
Kusumoto, Shinji [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Suita, Osaka 5650871, Japan
来源
2013 IEEE 21ST INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC) | 2013年
关键词
Code Clone; Program Analysis; Software Maintenance; Tool Comparison; SYSTEM;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A variety of methods detecting code clones has been proposed before. In order to detect gapped code clones, AST-based technique, PDG-based technique, metric-based technique and text-based technique using the LCS algorithm have been proposed. However, each of those techniques has limitations. For example, existing AST-based techniques and PDG-based techniques require costs for transforming source files into intermediate representations such as ASTs or PDGs and comparing them. Existing metric-based techniques and text-based techniques using the LCS algorithm cannot detect code clones if methods or blocks are partially duplicated. This paper proposes a new method that detects gapped code clones using the Smith-Waterman algorithm to resolve those limitations. The Smith-Waterman algorithm is an algorithm for identifying similar alignments between two sequences even if they include some gaps. The authors developed the proposed method as a software tool named CDSW, and confirmed that the proposed method could resolve the limitations by conducting a quantitative evaluation with Bellon's benchmark.
引用
收藏
页码:93 / 102
页数:10
相关论文
共 50 条
[21]   Gemini: Maintenance support environment based on code clone analysis [J].
Ueda, Y ;
Kamiya, T ;
Kusumoto, S ;
Inoue, K .
EIGHTH IEEE SYMPOSIUM ON SOFTWARE METRICS, PROCEEDINGS, 2002, :67-76
[22]   LLVM-Based Code Clone Detection Framework [J].
Avetisyan, Arutyun ;
Kurmangaleev, Shamil ;
Sargsyan, Sevak ;
Arutunian, Mariam ;
Belevantsev, Andrey .
TENTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGIES REVISED SELECTED PAPERS CSIT-2015, 2015, :100-104
[23]   Code Clone Detection with Hierarchical Attentive Graph Embedding [J].
Ji, Xiujuan ;
Liu, Lei ;
Zhu, Jingwen .
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2021, 31 (06) :837-861
[24]   CONCORD: Clone-Aware Contrastive Learning for Source Code [J].
Ding, Yangruibo ;
Chakraborty, Saikat ;
Buratti, Luca ;
Pujar, Saurabh ;
Morari, Alessandro ;
Kaiser, Gail ;
Ray, Baishakhi .
PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, :26-38
[25]   A systematic literature review on source code similarity measurement and clone detection: Techniques, applications, and challenges [J].
Zakeri-Nasrabadi, Morteza ;
Parsa, Saeed ;
Ramezani, Mohammad ;
Roy, Chanchal ;
Ekhtiarzadeh, Masoud .
JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 204
[26]   Clone Swarm: A Cloud Based Code-Clone Analysis Tool [J].
Bandi, Venkat ;
Roy, Chanchal K. ;
Gutwin, Carl .
PROCEEDINGS OF THE 2020 IEEE 14TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC '20), 2020, :52-56
[27]   AndroClonium: Bytecode-Level Code Clone Detection for Obfuscated Android Apps [J].
Foroughipour, Ardalan ;
Stakhanova, Natalia ;
Abazari, Farzaneh ;
Sistany, Bahman .
ICT SYSTEMS SECURITY AND PRIVACY PROTECTION (SEC 2022), 2022, 648 :379-397
[28]   Fast Code Clone Detection Based on Weighted Recursive Autoencoders [J].
Zeng, Jie ;
Ben, Kerong ;
Li, Xiaowei ;
Zhang, Xian .
IEEE ACCESS, 2019, 7 :125062-125078
[29]   SSA-HIAST: A Novel Framework for Code Clone Detection [J].
Saini, Neha ;
Singh, Sukhdip .
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (02) :2999-3017
[30]   Case Study on Semantic Clone Detection Based On Code Behavior [J].
Priyambadha, Bayu ;
Rochimah, Siti .
2014 International Conference on Data and Software Engineering (ICODSE), 2014,