Plagiarism detection based on semantic analysis

被引:4
|
作者
Mukherjee, Indrajit [1 ]
Kumar, Bipul [1 ]
Singh, Samarth [1 ]
Sharma, Kishan [1 ]
机构
[1] BIT Mesra, Dept Comp Sci & Engn, Ranchi 835215, Bihar, India
关键词
semantic similarity; plagiarism detection; documents; WordNet;
D O I
10.1504/IJKL.2018.092316
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Plagiarism means copy and paste for a text or change in some words or make use of synonymous or near synonymous words without citing the source. Plagiarism is on rise especially in the academic and research field due the availability of the digital text documents in the internet which can easily be copied and pasted. Existing approaches for detecting the plagiarism have either ignored or made limited use of information about semantic similarities between the words. We proposed a method to measure the semantic similarity between the documents by mapping keywords (verbs; adverbs; adjectives; descriptors; etc.) with the nouns and then finding the similarity between the mapped words that can rectify the existing shortcomings. The efficiency of the algorithm is evaluated on the dataset (corpus of Plagiarised Short Answers) (Clough and Stevenson, 2011). The experiments showed that the proposed algorithm gives significantly accurate results in detecting semantic based similarity between the documents and found to outperform previously published methods.
引用
收藏
页码:242 / 254
页数:13
相关论文
共 50 条
  • [11] USING CONCEPTS OF TEXT BASED PLAGIARISM DETECTION IN SOURCE CODE PLAGIARISM ANALYSIS
    Duracik, Michal
    Krsak, Emil
    Hrkut, Patrik
    PLAGIARISM ACROSS EUROPE AND BEYOND 2017, 2017, : 177 - 186
  • [12] Role Term-Based Semantic Similarity Technique for Idea Plagiarism Detection
    Osman, Ahmed Hamza
    Aljahdali, Hani Moetque
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (08) : 475 - 484
  • [13] Content-Based Scientific Figure Plagiarism Detection Using Semantic Mapping
    Eisa, Taiseer Abdalla Elfadil
    Salim, Naomie
    Abdelmaboud, Abdelzahir
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 420 - 427
  • [14] An Improved Semantic Plagiarism Detection Scheme Based on Chi-squared Automatic Interaction Detection
    Osman, Ahmed Hamza
    Salim, Naomie
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, : 640 - 647
  • [15] Semantic Similarity/Relatedness for Cross language plagiarism detection
    Ezzikouri, Hanane
    Oukessou, Mohamed
    Erritali, Mohammed
    2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 372 - 374
  • [16] A Plagiarism Detection Method Based on Learning Behavior Analysis
    Tang, Wen-jun
    Zou, Du
    Zhang, Ling
    4TH INTERNATIONAL CONFERENCE ON EDUCATION REFORM AND MODERN MANAGEMENT (ERMM 2017), 2017, : 43 - 47
  • [17] An Approach to Source-Code Plagiarism Detection and Investigation Using Latent Semantic Analysis
    Cosma, Georgina
    Joy, Mike
    IEEE TRANSACTIONS ON COMPUTERS, 2012, 61 (03) : 379 - 394
  • [18] An Improved Online Plagiarism Detection Approach for Semantic Analysis using Custom Search Engine
    Sharma, Kamalpreet
    Jindal, Balkrishan
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 764 - 768
  • [19] Plagiarism indication by syntactic-semantic analysis
    Tachaphetpiboon, S.
    Facundes, N.
    Amornraksa, T.
    2007 ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, 2007, : 237 - 240
  • [20] Using word semantic concepts for plagiarism detection in text documents
    Chang, Chia-Yang
    Lee, Shie-Jue
    Wu, Chih-Hung
    Liu, Chih-Feng
    Liu, Ching-Kuan
    INFORMATION RETRIEVAL JOURNAL, 2021, 24 (4-5): : 298 - 321