An Improved DE Algorithm to Optimise the Learning Process of a BERT-based Plagiarism Detection Model

被引:37
作者
Moravvej, Seyed Vahid [1 ]
Mousavirad, Seyed Jalaleddin [2 ]
Oliva, Diego [3 ]
Schaefer, Gerald [4 ]
Sobhaninia, Zahra [1 ]
机构
[1] Isfahan Univ Technol, Dept Comp Engn, Esfahan, Iran
[2] Hakim Sabzevari Univ, Comp Engn Dept, Sabzevar, Iran
[3] Univ Guadalajara, Dept Innovac Basada Informac & Conocimiento, Guadalajara, Jalisco, Mexico
[4] Loughborough Univ, Dept Comp Sci, Loughborough, Leics, England
来源
2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2022年
关键词
Plagiarism detection; BERT; LSTM; attention mechanism; differential evolution;
D O I
10.1109/CEC55065.2022.9870280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Plagiarism detection is a challenging task, aiming to identify similar items in two documents. In this paper, we present a novel approach to automatic plagiarism detection that combines BERT (bidirectional encoder representations from transformers) word embedding, attention mechanism-based long short-term memory (LSTM) networks, and an improved differential evolution (DE) algorithm for weight initialisation. BERT is used to pretrain deep bidirectional representations in all layers, while the pre-trained BERT model can be fine-tuned with only one extra output layer without significant changes in architecture. Deep learning algorithms often use the random weighting method for initialisation, followed by gradient-based optimisation algorithms such as back-propagation for training, making them susceptible to getting trapped in local optima. To address this, population-based metaheuristic algorithms such as DE can be used. We propose an improved DE algorithm with a clustering-based mutation operator, where first a winning cluster of candidate solutions is identified and a new updating strategy is then applied to include new candidate solutions in the current population. The proposed DE algorithm is used in LSTM, attention mechanism, and feed-forward neural networks to yield the initial seeds for subsequent gradient-based optimisation. We compare our proposed model with conventional and population-based approaches on three datasets (SNLI, MSRP and SemEva12014) and demonstrate it to give superior plagiarism detection performance.
引用
收藏
页数:7
相关论文
共 47 条
  • [1] [Anonymous], 2015, CS224d: deep learning for natural language processing stanford
  • [2] Salp Swarm Algorithm (SSA) for Training Feed-Forward Neural Networks
    Bairathi, Divya
    Gopalani, Dinesh
    [J]. SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2017, VOL 1, 2019, 816 : 521 - 534
  • [4] Bao W, 2018, INT CONF ASIAN LANG, P312, DOI 10.1109/IALP.2018.8629212
  • [5] Bekkar M., 2013, J INF ENG APPL, V3, P27, DOI DOI 10.5121/IJDKP.2013.3402
  • [6] Chen J, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P4946
  • [7] Chen Q., 2018, AAAI C ARTIFICIAL IN, V32
  • [8] Chi Z., 2018, Journal of Intelligent Learning Systems and Applications, V10, P121
  • [9] Emerging Trends Word2Vec
    Church, Kenneth Ward
    [J]. NATURAL LANGUAGE ENGINEERING, 2017, 23 (01) : 155 - 162
  • [10] A population-based algorithm-generator for real-parameter optimization
    Deb, K
    [J]. SOFT COMPUTING, 2005, 9 (04) : 236 - 253