Comparative Study of Arabic Stemming Algorithms for Topic Identification

被引:7
|
作者
Naili, Marwa [1 ]
Chaibi, Anja Habacha [1 ]
Ben Ghezala, Henda Hajjami [1 ]
机构
[1] Univ Manouba, Natl Sch Comp Sci ENSI, RIADI Lab, Manouba 2010, Tunisia
来源
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019) | 2019年 / 159卷
关键词
Arabic Stemming algorithms; LDA; Topic identification;
D O I
10.1016/j.procs.2019.09.238
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stemming process is one of the important pre-processing steps in different natural language process tasks such as text mining and information retrieval. Yet, stemming process can be considered as a difficult step to realize according to the used language. In fact, due to the complex morphology of Arabic language, stemming results can be influenced. Thus, several algorithms have been proposed in order to overcome stemming problems. In this paper, we investigate different stemming algorithms by presenting a comparative study in the field of Arabic topic identification. (C) 2019 The Authors. Published by Elsevier B.V.
引用
收藏
页码:794 / 802
页数:9
相关论文
共 50 条
  • [1] The Contribution of Stemming and Semantics in Arabic Topic Segmentation
    Naili, Marwa
    Chaibi, Anja Habacha
    Ben Ghezala, Henda Hajjami
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (02)
  • [2] Comparative Study of Truncating and Statistical Stemming Algorithms
    Memon, Sanaullah
    Mallah, Ghulam Ali
    Memon, K. N.
    Shaikh, A. G.
    Aasoori, Sunny K.
    Dehraj, Faheem Ul Hussain
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (02) : 563 - 568
  • [3] Arabic Word Stemming Algorithms and Retrieval Effectiveness
    Sembok, Tengku Mohd T.
    Abu Ata, Belal
    WORLD CONGRESS ON ENGINEERING - WCE 2013, VOL III, 2013, : 1577 - +
  • [4] COMPARATIVE STUDY OF TOPIC SEGMENTATION ALGORITHMS BASED ON LEXICAL COHESION: EXPERIMENTAL RESULTS ON ARABIC LANGUAGE
    Harrag, Fouzi
    Hamdi-Cherif, Aboubekeur
    Al-Salman, Abdulmalik Salman
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C) : 183 - 202
  • [5] A Comparative Study of Stemming Algorithms for Use With the Uzbek Language
    Ismailov, A.
    Jalil, M. M. Abdul
    Abdullah, Z.
    Abd Rahim, N. H.
    2016 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2016, : 7 - 12
  • [6] Arabic Text Stemming: Comparative Analysis.
    Mamoun, Rasha
    Ahmed, Mahmoud
    2016 CONFERENCE OF BASIC SCIENCES AND ENGINEERING STUDIES (SCGAC), 2016, : 88 - 93
  • [7] Is Paice Method Suitable For Evaluating Arabic Stemming Algorithms?
    Alserhan, Hasan Muaidi
    Alqrainy, Shihadeh
    Ayesh, Aladdin
    ICCES: 2008 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2007, : 131 - +
  • [8] Topic Modeling on Arabic Language Dataset: Comparative Study
    Abdelrazek, Aly
    Medhat, Walaa
    Gawish, Eman
    Hassan, Ahmed
    ADVANCES IN MODEL AND DATA ENGINEERING IN THE DIGITALIZATION ERA, MEDI 2022, 2022, 1751 : 61 - 71
  • [9] A Comparative Study on Text Representation Models for Topic Detection in Arabic
    Koulali, Rim
    Meziane, Abdelouafi
    COMPUTACION Y SISTEMAS, 2019, 23 (03): : 683 - 691
  • [10] Applying Topic Segmentation Algorithms on Arabic Language
    Harrag, Fouzi
    Hamdi-Cherif, Aboubekeur
    Al-Salman, Abdul Malik S.
    2009 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2009, : 865 - +