Plagiarism detection based on semantic analysis

被引:4
作者
Mukherjee, Indrajit [1 ]
Kumar, Bipul [1 ]
Singh, Samarth [1 ]
Sharma, Kishan [1 ]
机构
[1] BIT Mesra, Dept Comp Sci & Engn, Ranchi 835215, Bihar, India
关键词
semantic similarity; plagiarism detection; documents; WordNet;
D O I
10.1504/IJKL.2018.092316
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Plagiarism means copy and paste for a text or change in some words or make use of synonymous or near synonymous words without citing the source. Plagiarism is on rise especially in the academic and research field due the availability of the digital text documents in the internet which can easily be copied and pasted. Existing approaches for detecting the plagiarism have either ignored or made limited use of information about semantic similarities between the words. We proposed a method to measure the semantic similarity between the documents by mapping keywords (verbs; adverbs; adjectives; descriptors; etc.) with the nouns and then finding the similarity between the mapped words that can rectify the existing shortcomings. The efficiency of the algorithm is evaluated on the dataset (corpus of Plagiarised Short Answers) (Clough and Stevenson, 2011). The experiments showed that the proposed algorithm gives significantly accurate results in detecting semantic based similarity between the documents and found to outperform previously published methods.
引用
收藏
页码:242 / 254
页数:13
相关论文
共 50 条
  • [31] A Practical Algorithm for Plagiarism Detection Based on Search Engine
    Qiu, Zhuangli
    Xu, Dongling
    MECHANICAL, MATERIALS AND MANUFACTURING ENGINEERING, PTS 1-3, 2011, 66-68 : 2287 - 2290
  • [32] A Cross Language Plagiarism Detection Based on Cloud Computing
    Fan, Chih-Tien
    Nguyen Dang Minh
    Muhammad, Husaini
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 2090 - 2099
  • [33] Crowdcrawling Approach for Community Based Plagiarism Detection Service
    Butakov, Sergey
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 1093 - 1096
  • [34] Plagiarism Detection Tool Based on Programming Activity Logs
    Meier, Heidi
    Lepp, Marina
    Kutt, Rene
    2024 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE, EDUCON 2024, 2024,
  • [35] An Adaptive Image-based Plagiarism Detection Approach
    Meuschke, Norman
    Gondek, Christopher
    Seebacher, Daniel
    Breitinger, Corinna
    Keim, Daniel
    Gipp, Bela
    JCDL'18: PROCEEDINGS OF THE 18TH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, 2018, : 131 - 140
  • [36] Plagiarism detection in Chinese based on chunk and paragraph weight
    Wang, Tao
    Fan, Xiao-Zhong
    Liu, Jie
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2574 - 2579
  • [37] Figure Plagiarism Detection based on Textual Features Representation
    Eisa, Taiseer Abdalla Elfadil
    Salim, Naomie
    Alzahrani, Salha
    2017 6TH ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2017,
  • [38] The DOMJudge based Online Judge System with Plagiarism Detection
    Minh Tuan Pham
    Tan Bao Nguyen
    2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 213 - 218
  • [39] A Quantum Genetic Algorithm for Building a Semantic Textual Similarity Estimation Framework for Plagiarism Detection Applications
    Darwish, Saad M.
    Mhaimeed, Ibrahim Abdullah
    Elzoghabi, Adel A.
    ENTROPY, 2023, 25 (09)
  • [40] Identification of Plagiarism Using Syntactic and Semantic Filters
    Ram, R. Vijay Sundar
    Stamatatos, Efstathios
    Devi, Sobha Lalitha
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PART II, 2014, 8404 : 495 - 506