The Effect of using Light Stemming for Arabic Text Classification

被引:0
|
作者
Atwan, Jaffar [1 ]
Wedyan, Mohammad [2 ]
Bsoul, Qusay [3 ]
Hamadeen, Ahmad [4 ]
Alturki, Ryan [5 ]
Ikram, Mohammed [6 ]
机构
[1] Al Balqa Appl Univ, Dept Comp Informat Syst, Al Salt, Jordan
[2] Al Balqa Appl Univ, Fac Artificial Intelligence, Al Salt, Jordan
[3] Univ Sains Islam Malaysia, Fac Sci & Technol, Bandar Baru Nilai, Malaysia
[4] Al Balqa Appl Univ, Dept Comp Sci, Al Salt, Jordan
[5] Umm Al Qura Univ, Coll Comp & Informat Syst, Dept Informat Sci, Mecca, Saudi Arabia
[6] Umm Al Qura Univ, Univ Coll Al Jamoum, Comp Sci Dept, Mecca, Saudi Arabia
关键词
Arabic language; light stemming; information retrieval; Naive Bayes classification;
D O I
10.14569/IJACSA.2021.0120589
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Arabic is one of the Semitic languages in antiquity and one of the six official languages of the UN. Also, Arabic classification plays a significant and essential role in modern applications. There is a big difference between handling English text and Arabic text classification; preprocessing is also challenging for Arabic text. This paper presents the implementation of a Naive Bayes classifier for Arabic text with and without stemmer. A set of four categories and 800 documents were used from the Text Retrieval Conference (TREC) 2001 dataset. The results showed that Naive Bayes with light stemmer achieves better results than Naive Bayes without stemmer. The findings of the classifier accuracy by employing stemmer and without stemmer are as preprocessing. It reveals that the accuracy resulted from the light stemmer was better than the classifier without stemmer detection, which Naive Bayes Classification with light stemmer got 35.0745 higher than the Naive Bayes Classification 33.831% without stemmer. After contrasting them, the stemmer got better accuracy than the classifier.
引用
收藏
页码:768 / 773
页数:6
相关论文
共 50 条
  • [1] The Effect of Stemming on Arabic Text Classification: An Empirical Study
    Wahbeh, Abdullah
    Al-Kabi, Mohammed
    Al-Radaideh, Qasem
    Al-Shawakfa, Emad
    Alsmadi, Izzat
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2011, 1 (03) : 54 - 70
  • [2] Stemming versus light stemming as feature selection techniques for Arabic text categorization
    Duwairi, Rehab
    Al-Refai, Mohammad
    Khasawneh, Natheer
    2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 199 - 203
  • [3] The Use of Stemming in the Arabic Text and Its Impact on the Accuracy of Classification
    Atwan, Jaffar
    Wedyan, Mohammad
    Bsoul, Qusay
    Hammadeen, Ahmad
    Alturki, Ryan
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [4] Effect of Stemming on Hindi Text Classification
    Pimpalshende, Anjusha
    Singh, Preety
    Potnurwar, Archana
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 208 - 215
  • [5] Arabic Text Stemming Using Query Expansion Method
    Yusuf, Nuhu
    Yunus, Mohd Amin Mohd
    Wahid, Norfaradilla
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 3 - 11
  • [6] Effect of stemming on text similarity for Arabic language at sentence level
    Alhawarat, Mohammad O.
    Abdeljaber, Hikmat
    Hilal, Anwer
    PEERJ COMPUTER SCIENCE, 2021,
  • [7] Effect of Stemming on Text Similarity for Arabic Language at Sentence Level
    Alhawarat M.O.
    Abdeljaber H.
    Hilal A.
    PeerJ Computer Science, 2021, 7 : 1 - 18
  • [8] Impact of stemming on Arabic text summarization
    Alami, Nabil
    Meknassi, Mohammed
    Ouatik, Said Alaoui
    Ennahnahi, NourEddine
    2016 4TH IEEE INTERNATIONAL COLLOQUIUM ON INFORMATION SCIENCE AND TECHNOLOGY (CIST), 2016, : 338 - 343
  • [9] Stemming Algorithm for Arabic Text Using a Parallel Data Processing
    Bougar, Marieme
    Ziyati, El Houssaine
    THIRD INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, 797 : 261 - 268
  • [10] Addressing Stemming Algorithm for Arabic Text Using Spark Over Hadoop
    Bougar, Marieme
    Ziyati, El Houssaine
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2019): VOL 1 - ADVANCED INTELLIGENT SYSTEMS FOR EDUCATION AND INTELLIGENT LEARNING SYSTEM, 2020, 1102 : 74 - 82