An improved plagiarism detection scheme based on semantic role labeling

被引:49
|
作者
Osman, Ahmed Hamza [1 ,2 ]
Salim, Naomie [1 ]
Binwahlan, Mohammed Salem [3 ]
Alteeb, Rihab [4 ]
Abuobieda, Albaraa [1 ,2 ]
机构
[1] Univ Teknol Malaysia, Fac Comp Sci & Informat Syst, Skudai, Johor, Malaysia
[2] Int Univ Africa, Fac Comp Studies, Khartoum, Sudan
[3] Hadhramout Univ Sci & Technol, Fac Sci Appl, Seiyun, Hadhramout, Yemen
[4] Sudan Univ Sci & Technol, Fac Comp Sci, Khartoum, Sudan
关键词
Plagiarism detection; Semantic similarity; Semantic role; Arguments weight;
D O I
10.1016/j.asoc.2011.12.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Plagiarism occurs when the content is copied without permission or citation. One of the contributing factors is that many text documents on the internet are easily copied and accessed. This paper introduces a plagiarism detection technique based on the Semantic Role Labeling (SRL). The technique analyses and compares text based on the semantic allocation for each term inside the sentence. SRL is superior in generating arguments for each sentence semantically. Weighting for each argument generated by SRL to study its behaviour is also introduced in this paper. It was found that not all arguments affect the plagiarism detection process. In addition, experimental results on PAN-PC-09 data sets showed that our method significantly outperforms the modern methods for plagiarism detection in terms of Recall, Precision and F-measure. (C) 2012 Elsevier B. V. All rights reserved.
引用
收藏
页码:1493 / 1502
页数:10
相关论文
共 50 条
  • [21] An Improved DE Algorithm to Optimise the Learning Process of a BERT-based Plagiarism Detection Model
    Moravvej, Seyed Vahid
    Mousavirad, Seyed Jalaleddin
    Oliva, Diego
    Schaefer, Gerald
    Sobhaninia, Zahra
    2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
  • [22] Semantic role labeling for Arabic language using case-based reasoning approach
    Meguehout H.
    Bouhadada T.
    Laskri M.T.
    Meguehout, Hamza (meguehout.hamza@gmail.com), 1600, Springer Science and Business Media, LLC (20): : 363 - 372
  • [23] Event Lexical Database: A Semantic Role Labeling Approach
    Siaw, Nyuk Hiong
    Kulathuramaiyer, Narayanan
    Ranaivo-Malancon, Bali
    Labadin, Jane
    NEW TRENDS IN SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2014, 265 : 884 - 898
  • [24] Uncovering highly obfuscated plagiarism cases using fuzzy semantic-based similarity model
    Alzahrani, Salha M.
    Salim, Naomie
    Palade, Vasile
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2015, 27 (03) : 248 - 268
  • [25] AST-Based Plagiarism Detection Method
    Zhang, Liping
    Liu, Dongsheng
    Li, Yanchen
    Zhong, Mei
    INTERNET OF THINGS-BK, 2012, 312 : 611 - 618
  • [26] Plagiarism Detection in Homework Based on Image Hashing
    Chen, Ying
    Gan, Liping
    Zhang, Shiqing
    Guo, Wenping
    Chuang, Yuelong
    Zhao, Xiaoming
    DATA SCIENCE, PT II, 2017, 728 : 424 - 432
  • [27] A Coding Style-based Plagiarism Detection
    Arabyarmohamady, S.
    Moradi, H.
    Asadpour, M.
    2012 INTERNATIONAL CONFERENCE ON INTERACTIVE MOBILE AND COMPUTER AIDED LEARNING (IMCL), 2012, : 180 - 186
  • [28] Token-based Plagiarism Detection for Metamodels
    Saglam, Timur
    Hahner, Sebastian
    Wittler, Jan Willem
    Kuehn, Thomas
    ACM/IEEE 25TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS, MODELS 2022 COMPANION, 2022, : 138 - 141
  • [29] An AST-Based Code Plagiarism Detection Algorithm
    Zhao, Jingling
    Xia, Kunfeng
    Fu, Yilun
    Cui, Baojiang
    2015 10TH INTERNATIONAL CONFERENCE ON BROADBAND AND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS (BWCCA 2015), 2015, : 178 - 182
  • [30] A Plagiarism Detection Method Based on Learning Behavior Analysis
    Tang, Wen-jun
    Zou, Du
    Zhang, Ling
    4TH INTERNATIONAL CONFERENCE ON EDUCATION REFORM AND MODERN MANAGEMENT (ERMM 2017), 2017, : 43 - 47