Plagiarism Detection System for Indonesia Text Based Document by Fingerprint Method and Natural Language Processing Approach

被引:0
作者
Winarti, Titin [1 ]
Kerami, Djati [2 ]
Etp, Lussiana [3 ]
Sekarwati, Kemal Ade [4 ]
机构
[1] Semarang Univ, Fac Informat Technol & Commun, Semarang 50196, Indonesia
[2] Indonesia Univ, Fac Math & Nat Sci, Depok 16424, Indonesia
[3] Sch Informat Management & Comp Jakarta, Comp Syst, Jakarta 12140, Indonesia
[4] Gunadarma Univ, Fac Comp Sci & Informat Technol, Jakarta 16424, Indonesia
关键词
Plagiarism; Fingerprint; Natural Language Processing;
D O I
10.1166/asl.2016.7993
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The practice of plagiarism is very often carried out in a community environment for example in academia. So it can be stated that plagiarism is a major concern, especially in the academic environment, where it can affect both the credibility of the institution and its ability to ensure the quality of its students. In other words, the act of plagiarism may result in a decrease of creativity in the community. This research uses a combination of fingerprint method with natural language processing (NLP) approach. With the process or plagiarism detection system can be done through various methods, such as by the method of calculation algorithms Manber the similarities using the Jaccard coefficient and K-gram method as an alternative in the detection of document similarity, is expected to allow a user to use the application this without deciding the value of gram and its window to produce an accurate similarity value. Although it has been proven NLP techniques can improve the accuracy of detection tasks, there are other challenges remain. Current plagiarism detection tools are mostly limited to comparisons of suspicious plagiarised texts and potential original texts at string level. By doing stemming, the document similarity measurement process there was an increase of 31% measurement document based on documents that were tested.
引用
收藏
页码:3128 / 3131
页数:4
相关论文
共 50 条
  • [41] SEADer: A Social Engineering Attack Detection Method Based on Natural Language Processing and Artificial Neural Networks
    Lansley, Merton
    Polatidis, Nikolaos
    Kapetanakis, Stelios
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT I, 2019, 11683 : 686 - 696
  • [42] A Novel Approach for Spam Detection Using Natural Language Processing With AMALS Models
    Agarwal, Ruchi
    Dhoot, Anshita
    Kant, Surya
    Singh Bisht, Vimal
    Malik, Hasmat
    Ansari, Md. Fahim
    Afthanorhan, Asyraf
    Hossaini, Mohammad Asef
    IEEE ACCESS, 2024, 12 : 124298 - 124313
  • [43] Fake Media Detection Based on Natural Language Processing and Blockchain Approaches
    Shahbazi, Zeinab
    Byun, Yung-Cheol
    IEEE ACCESS, 2021, 9 : 128442 - 128453
  • [44] Natural Language Processing-based Model for Log Anomaly Detection
    Li, Zezhou
    Zhang, Jing
    Zhang, Xianbo
    Lin, Feng
    Wang, Chao
    Cai, Xingye
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 129 - 134
  • [45] A Hybrid Knowledge Mining Approach to Develop a System Framework for Odia Language Text Processing
    Mishra, Brojo Kishore
    Sahoo, Rekhanjali
    MATERIALS TODAY-PROCEEDINGS, 2018, 5 (01) : 1335 - 1340
  • [46] An Industrial Approach to Using Artificial Intelligence and Natural Language Processing for Accelerated Document Preparation in Drug Development
    Viswanath, Shekhar
    Fennell, Jared W.
    Balar, Kalpesh
    Krishna, Praful
    JOURNAL OF PHARMACEUTICAL INNOVATION, 2021, 16 (02) : 302 - 316
  • [47] An Industrial Approach to Using Artificial Intelligence and Natural Language Processing for Accelerated Document Preparation in Drug Development
    Shekhar Viswanath
    Jared W. Fennell
    Kalpesh Balar
    Praful Krishna
    Journal of Pharmaceutical Innovation, 2021, 16 : 302 - 316
  • [48] Research and Design of Knowledge System Construction System Based on Natural Language Processing
    Chen, Keliang
    Zu, Yunxiao
    Ren, Weizheng
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (12)
  • [49] A "catchy" copy and concept evaluation system using a natural language processing approach
    Ikeda, S
    Kaneda, S
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVI, PROCEEDINGS: COMPUTER SCIENCE III, 2002, : 267 - 272
  • [50] Mapping Free Text into MedDRA by Natural Language Processing: a Modular Approach in Designing and Evaluating Software Extensions
    Zorzi, Margherita
    Combi, Carlo
    Pozzani, Gabriele
    Moretti, Ugo
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 27 - 35