Plagiarism detection based on semantic analysis

被引:4
作者
Mukherjee, Indrajit [1 ]
Kumar, Bipul [1 ]
Singh, Samarth [1 ]
Sharma, Kishan [1 ]
机构
[1] BIT Mesra, Dept Comp Sci & Engn, Ranchi 835215, Bihar, India
关键词
semantic similarity; plagiarism detection; documents; WordNet;
D O I
10.1504/IJKL.2018.092316
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Plagiarism means copy and paste for a text or change in some words or make use of synonymous or near synonymous words without citing the source. Plagiarism is on rise especially in the academic and research field due the availability of the digital text documents in the internet which can easily be copied and pasted. Existing approaches for detecting the plagiarism have either ignored or made limited use of information about semantic similarities between the words. We proposed a method to measure the semantic similarity between the documents by mapping keywords (verbs; adverbs; adjectives; descriptors; etc.) with the nouns and then finding the similarity between the mapped words that can rectify the existing shortcomings. The efficiency of the algorithm is evaluated on the dataset (corpus of Plagiarised Short Answers) (Clough and Stevenson, 2011). The experiments showed that the proposed algorithm gives significantly accurate results in detecting semantic based similarity between the documents and found to outperform previously published methods.
引用
收藏
页码:242 / 254
页数:13
相关论文
共 50 条
  • [41] Intrinsic plagiarism detection
    Eissen, Sven Meyer zu
    Stein, Benno
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 565 - 569
  • [42] Design and Development of a Plagiarism Corpus in Thai for Plagiarism Detection
    Thaiprayoon, Santipong
    Palingoon, Pornpimon
    Trakultaweekoon, Kanokorn
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 376 - 380
  • [43] Unmasking text plagiarism using syntactic-semantic based natural language processing techniques: Comparisons, analysis and challenges
    Vani, K.
    Gupta, Deepa
    INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (03) : 408 - 432
  • [44] Academic Plagiarism Detection: A Systematic Literature Review
    Foltynek, Tomas
    Meuschke, Norman
    Gipp, Bela
    ACM COMPUTING SURVEYS, 2020, 52 (06)
  • [45] Texts Semantic Similarity Detection Based Graph Approach
    Mohebbi, Majid
    Talebpour, Alireza
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (02) : 246 - 251
  • [46] A New Online Plagiarism Detection System based on Deep Learning
    Hambi, El Mostafa
    Benabbou, Faouzia
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 470 - 478
  • [47] On Automatic Plagiarism Detection Based on n-Grams Comparison
    Barron-Cedeno, Alberto
    Rosso, Paolo
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 696 - 700
  • [48] A Ranking-Based Text Matching Approach for Plagiarism Detection
    Kong, Leilei
    Han, Zhongyuan
    Qi, Haoliang
    Lu, Zhimao
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2018, E101A (05) : 799 - 810
  • [49] Vector Representation of Words for Plagiarism Detection Based on String Matching
    Baba, Kensuke
    Nakatoh, Tetsuya
    Minami, Toshiro
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: SUPPORTING LEARNING, DECISION-MAKING AND COLLABORATION, HCI INTERNATIONAL 2017, PT II, 2017, 10274 : 341 - 350
  • [50] Plagiarism Detection Using Feature-Based Neural Networks
    Engels, Steve
    Lakshmanan, Vivek
    Craig, Michelle
    SIGCSE 2007: PROCEEDINGS OF THE THIRTY-EIGHTH SIGCSE TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, 2007, : 34 - 38