An Improved DE Algorithm to Optimise the Learning Process of a BERT-based Plagiarism Detection Model

被引：37

作者：

Moravvej, Seyed Vahid ^{[1
]}

Mousavirad, Seyed Jalaleddin ^{[2
]}

Oliva, Diego ^{[3
]}

Schaefer, Gerald ^{[4
]}

Sobhaninia, Zahra ^{[1
]}

机构：

[1] Isfahan Univ Technol, Dept Comp Engn, Esfahan, Iran

[2] Hakim Sabzevari Univ, Comp Engn Dept, Sabzevar, Iran

[3] Univ Guadalajara, Dept Innovac Basada Informac & Conocimiento, Guadalajara, Jalisco, Mexico

[4] Loughborough Univ, Dept Comp Sci, Loughborough, Leics, England

来源：

2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2022年

关键词：

Plagiarism detection; BERT; LSTM; attention mechanism; differential evolution;

D O I：

10.1109/CEC55065.2022.9870280

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Plagiarism detection is a challenging task, aiming to identify similar items in two documents. In this paper, we present a novel approach to automatic plagiarism detection that combines BERT (bidirectional encoder representations from transformers) word embedding, attention mechanism-based long short-term memory (LSTM) networks, and an improved differential evolution (DE) algorithm for weight initialisation. BERT is used to pretrain deep bidirectional representations in all layers, while the pre-trained BERT model can be fine-tuned with only one extra output layer without significant changes in architecture. Deep learning algorithms often use the random weighting method for initialisation, followed by gradient-based optimisation algorithms such as back-propagation for training, making them susceptible to getting trapped in local optima. To address this, population-based metaheuristic algorithms such as DE can be used. We propose an improved DE algorithm with a clustering-based mutation operator, where first a winning cluster of candidate solutions is identified and a new updating strategy is then applied to include new candidate solutions in the current population. The proposed DE algorithm is used in LSTM, attention mechanism, and feed-forward neural networks to yield the initial seeds for subsequent gradient-based optimisation. We compare our proposed model with conventional and population-based approaches on three datasets (SNLI, MSRP and SemEva12014) and demonstrate it to give superior plagiarism detection performance.

引用

页数：7

共 47 条

[1] [Anonymous], 2015, CS224d: deep learning for natural language processing stanford
[2] Salp Swarm Algorithm (SSA) for Training Feed-Forward Neural Networks
Bairathi, Divya
Gopalani, Dinesh
[J]. SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2017, VOL 1, 2019, 816 : 521 - 534
[3] Adaptive k-tournament mutation scheme for differential evolution
Bajer, Drazen
[J]. APPLIED SOFT COMPUTING, 2019, 85
[4] Bao W, 2018, INT CONF ASIAN LANG, P312, DOI 10.1109/IALP.2018.8629212
[5] Bekkar M., 2013, J INF ENG APPL, V3, P27, DOI DOI 10.5121/IJDKP.2013.3402
[6] Chen J, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P4946
[7] Chen Q., 2018, AAAI C ARTIFICIAL IN, V32
[8] Chi Z., 2018, Journal of Intelligent Learning Systems and Applications, V10, P121
[9] Emerging Trends Word2Vec
Church, Kenneth Ward
[J]. NATURAL LANGUAGE ENGINEERING, 2017, 23 (01) : 155 - 162
[10] A population-based algorithm-generator for real-parameter optimization
Deb, K
[J]. SOFT COMPUTING, 2005, 9 (04) : 236 - 253

← 1 2 3 4 5 →