A Mixed Fuzzy Similarity Approach to Detect Plagiarism in Persian Texts

被引:0
|
作者
Ahangarbahan, Hamid [1 ]
Montazer, Gholam Ali [1 ]
机构
[1] Tarbiat Modares Univ, Sch Engn, Tehran, Iran
来源
ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT I (IWANN 2015) | 2015年 / 9094卷
关键词
Plagiarism; Similarity metric; Fuzzy sets; Semantic similarity; Lexical similarity;
D O I
10.1007/978-3-319-19258-1_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A variety of methods and metrics have been offered so far to measure the extent of similarity among various documents and plagiarism detection systems. However, most of them do not take ambiguity inherent in natural language into account. Therefore, in this paper, a new method taking lexical and semantic features and similarity measures into consideration has been proposed. In the first step, after preprocessing and removing stop word, a text was divided into two parts: general and domain-specific knowledge words. Then, the mixed lexical and semantic fuzzy inference system was designed to assess text similarity. The proposed method was evaluated on Persian paper abstracts of International Conference on e-Learning and e-Teaching (ICELET) Corpus and using IT domain knowledge ontology. The results indicated that the proposed method can achieve a rate of 79% in terms of precision and can detect 83% of the plagiarism cases.
引用
收藏
页码:525 / 534
页数:10
相关论文
共 21 条
  • [1] Fuzzy Semantic-Based String Similarity Experiments to Detect Plagiarism in Indonesian Documents
    Umareta, Chonan Firda Odayakana
    Mariyah, Siti
    2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,
  • [2] Testing of support tools to detect plagiarism in academic Japanese texts
    Tolga Özşen
    İrem Saka
    Özgür Çelik
    Salim Razı
    Senem Çente Akkan
    Dita Henek Dlabolova
    Education and Information Technologies, 2023, 28 : 13287 - 13321
  • [3] Testing of support tools to detect plagiarism in academic Japanese texts
    Ozsen, Tolga
    Saka, Irem
    Celik, Ozgur
    Razi, Salim
    Akkan, Senem Cente
    Dlabolova, Dita Henek
    EDUCATION AND INFORMATION TECHNOLOGIES, 2023, 28 (10) : 13287 - 13321
  • [4] A Rough Set based Approach to Detect Plagiarism
    Bhavani, M.
    Reddy, K. Thammi
    Shashi, M.
    TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 1471 - +
  • [5] Texts Semantic Similarity Detection Based Graph Approach
    Mohebbi, Majid
    Talebpour, Alireza
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (02) : 246 - 251
  • [6] Prototype of Online Examination on MoLearn Applications Using Text Similarity to Detect Plagiarism
    Lemantara, Julianto
    Sunarto, M. J. Dewiyani
    Hariadi, Bambang
    Sagirani, Tri
    Amelia, Tan
    2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, COMPUTER, AND ELECTRICAL ENGINEERING (ICITACEE), 2018, : 131 - 136
  • [7] Uncovering highly obfuscated plagiarism cases using fuzzy semantic-based similarity model
    Alzahrani, Salha M.
    Salim, Naomie
    Palade, Vasile
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2015, 27 (03) : 248 - 268
  • [8] FTLM: A Fuzzy TOPSIS Language Modeling Approach for Plagiarism Severity Assessment
    Sharmila, P.
    Anbananthen, Kalaiarasi Sonai Muthu
    Gunasekaran, Nithyakala
    Balasubramaniam, Baarathi
    Chelliah, Deisy
    IEEE ACCESS, 2024, 12 : 122597 - 122608
  • [9] An approach to the use of word embeddings in a textual similarity task for Spanish texts
    Lopez-Solaz, Tomas
    Troyano, Jose A.
    Javier Ortega, F.
    Enriquez, Fernando
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2016, (57): : 67 - 74
  • [10] A multilingual fuzzy approach for classifying Twitter data using fuzzy logic and semantic similarity
    Madani, Youness
    Erritali, Mohammed
    Bengourram, Jamaa
    Sailhan, Francoise
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12): : 8655 - 8673