Plagiarism Detection System for Indonesia Text Based Document by Fingerprint Method and Natural Language Processing Approach

被引:0
作者
Winarti, Titin [1 ]
Kerami, Djati [2 ]
Etp, Lussiana [3 ]
Sekarwati, Kemal Ade [4 ]
机构
[1] Semarang Univ, Fac Informat Technol & Commun, Semarang 50196, Indonesia
[2] Indonesia Univ, Fac Math & Nat Sci, Depok 16424, Indonesia
[3] Sch Informat Management & Comp Jakarta, Comp Syst, Jakarta 12140, Indonesia
[4] Gunadarma Univ, Fac Comp Sci & Informat Technol, Jakarta 16424, Indonesia
关键词
Plagiarism; Fingerprint; Natural Language Processing;
D O I
10.1166/asl.2016.7993
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The practice of plagiarism is very often carried out in a community environment for example in academia. So it can be stated that plagiarism is a major concern, especially in the academic environment, where it can affect both the credibility of the institution and its ability to ensure the quality of its students. In other words, the act of plagiarism may result in a decrease of creativity in the community. This research uses a combination of fingerprint method with natural language processing (NLP) approach. With the process or plagiarism detection system can be done through various methods, such as by the method of calculation algorithms Manber the similarities using the Jaccard coefficient and K-gram method as an alternative in the detection of document similarity, is expected to allow a user to use the application this without deciding the value of gram and its window to produce an accurate similarity value. Although it has been proven NLP techniques can improve the accuracy of detection tasks, there are other challenges remain. Current plagiarism detection tools are mostly limited to comparisons of suspicious plagiarised texts and potential original texts at string level. By doing stemming, the document similarity measurement process there was an increase of 31% measurement document based on documents that were tested.
引用
收藏
页码:3128 / 3131
页数:4
相关论文
共 50 条
  • [31] Identifying Mentions of Pain in Mental Health Records Text: A Natural Language Processing Approach
    Chaturvedi, Jaya
    Velupillai, Sumithra
    Stewart, Robert
    Roberts, Angus
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 695 - 699
  • [32] Towards an evolutionary-based approach for natural language processing
    Manzoni, Luca
    Jakobovic, Domagoj
    Mariot, Luca
    Picek, Stjepan
    Castelli, Mauro
    GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2020, : 985 - 993
  • [33] Natural Language Processing based Anomalous System Call Sequences Detection with Virtual Memory Introspection
    Peddoju, Suresh K.
    Upadhyay, Himanshu
    Soni, Jayesh
    Prabakar, Nagarajan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 455 - 460
  • [34] Natural language processing based anomalous system call sequences detection with virtual memory introspection
    Peddoju S.K.
    Upadhyay H.
    Soni J.
    Prabakar N.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (05): : 455 - 460
  • [35] A systematic review of applications of natural language processing and future challenges with special emphasis in text-based emotion detection
    Kusal, Sheetal
    Patil, Shruti
    Choudrie, Jyoti
    Kotecha, Ketan
    Vora, Deepali
    Pappas, Ilias
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (12) : 15129 - 15215
  • [36] An AV control method by dialog based on natural language processing
    Matsuda, M
    Nonaka, T
    Hase, T
    PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS 2005, 2005, : 31 - 33
  • [37] A systematic review of applications of natural language processing and future challenges with special emphasis in text-based emotion detection
    Sheetal Kusal
    Shruti Patil
    Jyoti Choudrie
    Ketan Kotecha
    Deepali Vora
    Ilias Pappas
    Artificial Intelligence Review, 2023, 56 : 15129 - 15215
  • [38] Drug-drug interaction extraction-based system: An natural language processing approach
    Machado, Jose
    Rodrigues, Carla
    Sousa, Regina
    Gomes, Luis Mendes
    EXPERT SYSTEMS, 2025, 42 (01)
  • [39] Research on web monitoring system based on natural language processing
    Liu, L
    Fan, XZ
    Zhao, XP
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 746 - 751
  • [40] A Hybrid Intelligent Text Watermarking and Natural Language Processing Approach for Transferring and Receiving an Authentic English Text Via Internet
    Hilal, Anwer Mustafa
    Al-Wesabi, Fahd N.
    Abdelmaboud, Abdelzahir
    Hamza, Manar Ahmed
    Mahzari, Mohammad
    Hassan, Abdulkhaleq Q. A.
    COMPUTER JOURNAL, 2022, 65 (02) : 423 - 435