Gapped Code Clone Detection with Lightweight Source Code Analysis

被引:0
作者
Murakami, Hiroaki [1 ]
Hotta, Keisuke [1 ]
Higo, Yoshiki [1 ]
Igaki, Hiroshi [1 ]
Kusumoto, Shinji [1 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Suita, Osaka 5650871, Japan
来源
2013 IEEE 21ST INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC) | 2013年
关键词
Code Clone; Program Analysis; Software Maintenance; Tool Comparison; SYSTEM;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A variety of methods detecting code clones has been proposed before. In order to detect gapped code clones, AST-based technique, PDG-based technique, metric-based technique and text-based technique using the LCS algorithm have been proposed. However, each of those techniques has limitations. For example, existing AST-based techniques and PDG-based techniques require costs for transforming source files into intermediate representations such as ASTs or PDGs and comparing them. Existing metric-based techniques and text-based techniques using the LCS algorithm cannot detect code clones if methods or blocks are partially duplicated. This paper proposes a new method that detects gapped code clones using the Smith-Waterman algorithm to resolve those limitations. The Smith-Waterman algorithm is an algorithm for identifying similar alignments between two sequences even if they include some gaps. The authors developed the proposed method as a software tool named CDSW, and confirmed that the proposed method could resolve the limitations by conducting a quantitative evaluation with Bellon's benchmark.
引用
收藏
页码:93 / 102
页数:10
相关论文
共 50 条
  • [21] Code Clone Detection with Hierarchical Attentive Graph Embedding
    Ji, Xiujuan
    Liu, Lei
    Zhu, Jingwen
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2021, 31 (06) : 837 - 861
  • [22] LLVM-Based Code Clone Detection Framework
    Avetisyan, Arutyun
    Kurmangaleev, Shamil
    Sargsyan, Sevak
    Arutunian, Mariam
    Belevantsev, Andrey
    TENTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGIES REVISED SELECTED PAPERS CSIT-2015, 2015, : 100 - 104
  • [23] A systematic literature review on source code similarity measurement and clone detection: Techniques, applications, and challenges
    Zakeri-Nasrabadi, Morteza
    Parsa, Saeed
    Ramezani, Mohammad
    Roy, Chanchal
    Ekhtiarzadeh, Masoud
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 204
  • [24] CONCORD: Clone-Aware Contrastive Learning for Source Code
    Ding, Yangruibo
    Chakraborty, Saikat
    Buratti, Luca
    Pujar, Saurabh
    Morari, Alessandro
    Kaiser, Gail
    Ray, Baishakhi
    PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 26 - 38
  • [25] Clone Swarm: A Cloud Based Code-Clone Analysis Tool
    Bandi, Venkat
    Roy, Chanchal K.
    Gutwin, Carl
    PROCEEDINGS OF THE 2020 IEEE 14TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC '20), 2020, : 52 - 56
  • [26] AndroClonium: Bytecode-Level Code Clone Detection for Obfuscated Android Apps
    Foroughipour, Ardalan
    Stakhanova, Natalia
    Abazari, Farzaneh
    Sistany, Bahman
    ICT SYSTEMS SECURITY AND PRIVACY PROTECTION (SEC 2022), 2022, 648 : 379 - 397
  • [27] SSA-HIAST: A Novel Framework for Code Clone Detection
    Saini, Neha
    Singh, Sukhdip
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (02): : 2999 - 3017
  • [28] Fast Code Clone Detection Based on Weighted Recursive Autoencoders
    Zeng, Jie
    Ben, Kerong
    Li, Xiaowei
    Zhang, Xian
    IEEE ACCESS, 2019, 7 : 125062 - 125078
  • [29] Case Study on Semantic Clone Detection Based On Code Behavior
    Priyambadha, Bayu
    Rochimah, Siti
    2014 International Conference on Data and Software Engineering (ICODSE), 2014,
  • [30] Open-Source Tools and Benchmarks for Code-Clone Detection: Past, Present, and Future Trends
    Walker, Andrew
    Cerny, Tomas
    Song, Eungee
    APPLIED COMPUTING REVIEW, 2019, 19 (04): : 28 - 39