共 24 条
A Novel Source Code Clone Detection Method Based on Dual-GCN and IVHFS
被引:1
|作者:
Yang, Haixin
[1
]
Li, Zhen
[1
]
Guo, Xinyu
[1
]
机构:
[1] Hebei Univ, Sch Cyber Secur & Comp, Baoding 071002, Peoples R China
来源:
关键词:
source code clone detection;
software engineering;
graph convolutional network;
interval-valued hesitant fuzzy set;
abstract syntax tree;
control flow graph;
D O I:
10.3390/electronics12061315
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Source code clone detection, which can identify code fragments with similar functions, plays a significant role in software development and quality assurance. Existing methods either extract single syntactic or semantic information, or ignore the associated information between code statements in different structures. It is difficult for these methods to effectively detect clone pairs with similar functions. In this paper, we propose a new model based on a dual graph convolutional network (GCN) and interval-valued hesitant fuzzy set (IVHFS), which we named DG-IVHFS. Specifically, we simplified and grouped the abstract syntax tree (AST) of source code to obtain the group representations. The group representations of the AST, as well as the control flow graph (CFG) representations, were transformed into graph structures, and then we applied GCNs on them to learn dependencies between nodes. In addition, we introduced IVHFS into the model for a more comprehensive evaluation of similarity. Our experimental results demonstrated that the precision, recall, and F1-scores of DG-IVHFS on the BigCloneBench and GoogleCodeJam datasets reached 98, 97 and 97% and 98, 93 and 95%, respectively, exceeding current state-of-the-art models. Moreover, our model performed well in terms of time consumption.
引用
收藏
页数:18
相关论文