An improved plagiarism detection scheme based on semantic role labeling

被引:49
作者
Osman, Ahmed Hamza [1 ,2 ]
Salim, Naomie [1 ]
Binwahlan, Mohammed Salem [3 ]
Alteeb, Rihab [4 ]
Abuobieda, Albaraa [1 ,2 ]
机构
[1] Univ Teknol Malaysia, Fac Comp Sci & Informat Syst, Skudai, Johor, Malaysia
[2] Int Univ Africa, Fac Comp Studies, Khartoum, Sudan
[3] Hadhramout Univ Sci & Technol, Fac Sci Appl, Seiyun, Hadhramout, Yemen
[4] Sudan Univ Sci & Technol, Fac Comp Sci, Khartoum, Sudan
关键词
Plagiarism detection; Semantic similarity; Semantic role; Arguments weight;
D O I
10.1016/j.asoc.2011.12.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Plagiarism occurs when the content is copied without permission or citation. One of the contributing factors is that many text documents on the internet are easily copied and accessed. This paper introduces a plagiarism detection technique based on the Semantic Role Labeling (SRL). The technique analyses and compares text based on the semantic allocation for each term inside the sentence. SRL is superior in generating arguments for each sentence semantically. Weighting for each argument generated by SRL to study its behaviour is also introduced in this paper. It was found that not all arguments affect the plagiarism detection process. In addition, experimental results on PAN-PC-09 data sets showed that our method significantly outperforms the modern methods for plagiarism detection in terms of Recall, Precision and F-measure. (C) 2012 Elsevier B. V. All rights reserved.
引用
收藏
页码:1493 / 1502
页数:10
相关论文
共 50 条
  • [41] Plagiarism Detection Tool Based on Programming Activity Logs
    Meier, Heidi
    Lepp, Marina
    Kutt, Rene
    [J]. 2024 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE, EDUCON 2024, 2024,
  • [42] A Quantum Genetic Algorithm for Building a Semantic Textual Similarity Estimation Framework for Plagiarism Detection Applications
    Darwish, Saad M.
    Mhaimeed, Ibrahim Abdullah
    Elzoghabi, Adel A.
    [J]. ENTROPY, 2023, 25 (09)
  • [43] A resource-saving collective approach to biomedical semantic role labeling
    Richard Tzong-Han Tsai
    Po-Ting Lai
    [J]. BMC Bioinformatics, 15
  • [44] SRL-GSM: A hybrid approach based on semantic role labeling and general statistic method for text summarization
    Suanmali L.
    Salim N.
    Binwahlan M.S.
    [J]. Journal of Applied Sciences, 2010, 10 (03) : 166 - 173
  • [45] Vector Representation of Words for Plagiarism Detection Based on String Matching
    Baba, Kensuke
    Nakatoh, Tetsuya
    Minami, Toshiro
    [J]. HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: SUPPORTING LEARNING, DECISION-MAKING AND COLLABORATION, HCI INTERNATIONAL 2017, PT II, 2017, 10274 : 341 - 350
  • [46] Plagiarism Detection Using Feature-Based Neural Networks
    Engels, Steve
    Lakshmanan, Vivek
    Craig, Michelle
    [J]. SIGCSE 2007: PROCEEDINGS OF THE THIRTY-EIGHTH SIGCSE TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, 2007, : 34 - 38
  • [47] AntiPlag: Plagiarism Detection on Electronic Submissions of Text Based Assignments
    Jiffriya, M. A. C.
    Jahan, M. A. C. Akmal
    Ragel, Roshan G.
    Deegalla, Sampath
    [J]. 2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, : 376 - +
  • [48] Word Sequence-based Newspaper Articles Plagiarism Detection
    Chung, Hyun-Sook
    Park, Jong-An
    An, Young-Eun
    Kim, Jung-Min
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (06): : 2095 - 2113
  • [49] A New Online Plagiarism Detection System based on Deep Learning
    Hambi, El Mostafa
    Benabbou, Faouzia
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 470 - 478
  • [50] On Automatic Plagiarism Detection Based on n-Grams Comparison
    Barron-Cedeno, Alberto
    Rosso, Paolo
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 696 - 700