Deep learning application on code clone detection: A review of current knowledge

被引:30
作者
Lei, Maggie [1 ]
Li, Hao [1 ]
Li, Ji [1 ]
Aundhkar, Namrata [1 ]
Kim, Dae-Kyoo [1 ]
机构
[1] Oakland Univ, Comp Sci & Engn, Rochester, MI 48309 USA
关键词
Code clone; Code smell; Deep learning; Duplicate code; Machine learning; Literature review;
D O I
10.1016/j.jss.2021.111141
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Bad smells in code are indications of low code quality representing potential threats to the maintainability and reusability of software. Code clone is a type of bad smells caused by code fragments that have the same functional semantics with syntactic variations. In the recent years, the research on duplicate code has been dramatically geared up by deep learning techniques powered by advances in computing power. However, there exists little work studying the current state-of-art and future prospects in the area of applying deep learning to code clone detection. In this paper, we present a systematic review of the literature on the application of deep learning on code clone detection. We aim to find and study the most recent work on the subject, discuss their limitations and challenges, and provide insights on the future work. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页数:15
相关论文
共 72 条
[1]  
[Anonymous], 2014, INT C MACH LEARN
[2]   Machine learning techniques for code smell detection: A systematic literature review and meta-analysis [J].
Azeem, Muhammad Ilyas ;
Palomba, Fabio ;
Shi, Lin ;
Wang, Qing .
INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 108 :115-138
[3]  
Balas V.E., 2019, Handbook of Deep Learning Applications, VVolume 136
[4]  
Basit H. A., 2005, P EUR SOFTW ENG C AC, V30, P156, DOI DOI 10.1145/1095430.1081733
[5]  
Bishop C.M., 1995, Neural Networks for Pattern Recognition (Advanced Texts inEconometrics(Paperback)): Bishop, DOI DOI 10.1201/9781420050646.PTB6
[6]  
Bouaziz M, 2016, IEEE W SP LANG TECH, P218, DOI 10.1109/SLT.2016.7846268
[7]   Capturing Source Code Semantics via Tree-based Convolution over API-enhanced AST [J].
Chen, Long ;
Ye, Wei ;
Zhang, Shikun .
CF '19 - PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS, 2019, :174-182
[8]   CLCMiner: Detecting Cross-Language Clones without Intermediates [J].
Cheng, Xiao ;
Peng, Zhiming ;
Jiang, Lingxiao ;
Zhong, Hao ;
Yu, Haibo ;
Zhao, Jianjun .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (02) :273-284
[9]  
Dang S., 2015, INT J SCI RES, V4, P1903
[10]   A Systematic Literature Review on Bad Smells-5 W's: Which, When, What, Who, Where [J].
de Paulo Sobrinho, Elder Vicente ;
De Lucia, Andrea ;
Maia, Marcelo de Almeida .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (01) :17-66