Hybrid deep learning model for Arabic text classification based on mutual information

被引:9
作者
Abdulghani, Farah A. [1 ]
Abdullah, Nada A. Z. [1 ]
机构
[1] Univ Baghdad, Coll Sci, Dept Comp, Baghdad, Iraq
关键词
Arabic text classification; Deep learning; Mutual information; C-LSTM;
D O I
10.1080/02522667.2022.2060910
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Text categorization refers to the process of grouping text or documents into classes or categories according to their content, which is a significant task in natural language processing. The majority of the present work focused on English text, with a few experiments on Arabic text. The text classification process consists of many steps, from preprocessing documents (removing stop words and stem method), to feature extraction and classification phase. A new improved approach for Arabic text categorization was proposed using mutual information in a hybrid deep learning model for classification. To test the proposed model, two datasets of Arabic documents are employed. The experimental results demonstrate that employing the proposed mutual information exceeds other prior techniques in terms of performance. In Akhbarona corpus, the Multi-Layer Perceptron achieved a minimum accuracy of 96.09%, while the hybrid Convolution-Long Short-Term Memory had a performance level of 99.28%. In Khaleej corpus, the Gated Recurrent Unit had the maximum accuracy of 98.23%, while Multi-Layer Perceptron had the lowest accuracy of 97.23%
引用
收藏
页码:1901 / 1908
页数:8
相关论文
共 14 条
[1]  
Abdeen MAR, 2019, INT J ADV COMPUT SC, V10, P677
[2]   A Superior Arabic Text Categorization Deep Model (SATCDM) [J].
Alhawarat, M. ;
Aseeri, Ahmad O. .
IEEE ACCESS, 2020, 8 :24653-24661
[3]  
Ansari N., 2021, Iraqi Journal of Science, DOI DOI 10.24996/IJS.2021.62.8.32
[4]  
Bahassine S, 2016, INT CONF INTELL SYS
[5]   Arabic Text Classification Using Deep Learning Technics [J].
Boukil, Samir ;
Biniz, Mohamed ;
El Adnani, Fatiha ;
Cherrat, Loubna ;
El Moutaouakkil, Abd Elmaj Id .
INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2018, 11 (09) :103-114
[6]  
El-Alami F. -Z., 2016, PROC INT ARAB C INFO, P1
[7]  
Elnagar A, 2019, PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE AND SPEECH PROCESSING, ICNLSP 2019, P59
[8]   Arabic text classification using deep learning models [J].
Elnagar, Ashraf ;
Al-Debsi, Ridhwan ;
Einea, Omar .
INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)
[9]  
Galal M, 2019, J Theor Appl Inf Technol, V97, P3412
[10]   Efficient multi-cluster feature selection on text data [J].
Gupta, Ananya ;
Begum, Shahin Ara .
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (08) :1583-1598