A New Online Plagiarism Detection System based on Deep Learning

被引:0
作者
Hambi, El Mostafa [1 ]
Benabbou, Faouzia [1 ]
机构
[1] Univ Hassan 2, Fac Sci Ben Msik, Informat Technol & Modeling Lab, Casablanca, Morocco
关键词
Plagiarism detection; plagiarism detection tools; deep learning; Doc2vec; Stacked Long Short-Term Memory (SLSTM); Convolutional Neural Network (CNN); Siamese neural network; ACADEMIC INTEGRITY;
D O I
10.14569/IJACSA.2020.0110956
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Plagiarism is an increasingly widespread and growing problem in the academic field. Several plagiarism techniques are used by fraudsters, ranging from a simple synonym replacement, sentence structure modification, to more complex method involving several types of transformation. Human based plagiarism detection is difficult, not accurate, and time-consuming process. In this paper we propose a plagiarism detection framework based on three deep learning models: Doc2vec, Siamese Long Short-term Memory (SLSTM) and Convolutional Neural Network (CNN). Our system uses three layers: Preprocessing Layer including word embedding, Learning Layers and Detection Layer. To evaluate our system, we carried out a study on plagiarism detection tools from the academic field and make a comparison based on a set of features. Compared to other works, our approach performs a good accuracy of 98.33 % and can detect different types of plagiarism, enables to specify another dataset and supports to compare the document from an internet search.
引用
收藏
页码:470 / 478
页数:9
相关论文
共 17 条
[1]  
Al-Shamery E. S., 2016, Indian journal of science and technology, V9, P1
[2]  
[Anonymous], INT C COMP SYST TECH
[3]  
Asim m El tahir Ali, 2011, OVERVIEW COMP PLAGIA, P161
[4]  
Benabbou F., 2020, INFORM FUSION, V9, DOI [10.11591/ijai.v9.i1, DOI 10.11591/IJAI.V9.I1]
[5]  
Hage J., 2010, UUCS2010015 DEP INF
[6]  
Hambi El mostafa, INT J COMPUTER SCI N, V19, P110
[7]  
Lancaster Thomas, PLAG PREV PRACT POL
[8]  
Liddell J., 2003, Community Junior College Libraries, V11, P43, DOI [DOI 10.1300/J107v11n03_07, 10.1300/J107v11n03_07, DOI 10.1300/J107V11N03_07]
[9]  
Meuschke N, 2013, INT J EDUC INTEGR, V9, P50
[10]  
Mozgovo Maxim, 2010, J ED COMPUTING RES