Multi-Label Arabic Text Classification: An Overview

被引:0
作者
Aljedani, Nawal [1 ]
Alotaibi, Reem [1 ]
Taileb, Mounira [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
Machine learning; text classification; multi-label classification; Arabic natural language processing; hierarchical classification; Lexicon approach; ALGORITHMS; TREES;
D O I
10.14569/IJACSA.2020.0111086
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is a massive growth of text documents on the web. This led to the increasing need for methods that can organize and classify electronic documents (instances) automatically. Multi-label classification task is widely used in real-world problems and it has been applied on different applications. It assigns multiple labels for each document simultaneously. Few and insufficient research studies have investigated the multi-label text classification problem in the Arabic language. Therefore, this survey paper aims to present an extensive review of the existing multi-label classification methods and techniques that can deal with multi-label problem. Besides, we focus on Arabic language by covering the relevant applications of multi-label classification on the Arabic text, and identify the main challenges faced by these studies. Furthermore, this survey presents an experimental comparisons of different multi-label classification methods applied for the Arabic context and points out some baseline results. We found that further investigations are also needed to improve the multi-label classification task in the Arabic language, especially the hierarchical classification task.
引用
收藏
页码:694 / 706
页数:13
相关论文
共 46 条
[21]   A lexicon based approach for classifying Arabic multi-labeled text [J].
Hmeidi, Ismail ;
Al-Ayyoub, Mahmoud ;
Mahyoub, Nizar A. ;
Shehab, Mohammed A. .
INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2016, 12 (04) :504-532
[22]   Label ranking by learning pairwise preferences [J].
Huellermeier, Eyke ;
Fuernkranz, Johannes ;
Cheng, Weiwei ;
Brinker, Klaus .
ARTIFICIAL INTELLIGENCE, 2008, 172 (16-17) :1897-1916
[23]   Binary relevance efficacy for multilabel classification [J].
Luaces, Oscar ;
Diez, Jorge ;
Barranquero, Jose ;
Jose del Coz, Juan ;
Bahamonde, Antonio .
PROGRESS IN ARTIFICIAL INTELLIGENCE, 2012, 1 (04) :303-313
[24]   Web Genre Classification via Hierarchical Multi-label Classification [J].
Madjarov, Gjorgji ;
Vidulin, Vedrana ;
Dimitrovski, Ivica ;
Kocev, Dragi .
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 :9-17
[25]  
Mubarak H., 2014, P EMNLP 2014 WORKSHO, P1, DOI DOI 10.3115/V1/W14-3601
[26]   Classifier chains for multi-label classification [J].
Read, Jesse ;
Pfahringer, Bernhard ;
Holmes, Geoff ;
Frank, Eibe .
MACHINE LEARNING, 2011, 85 (03) :333-359
[27]   Multi-label Classification using Ensembles of Pruned Sets [J].
Read, Jesse ;
Pfahringer, Bernhard ;
Holmes, Geoff .
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :995-1000
[28]   Improved boosting algorithms using confidence-rated predictions [J].
Schapire, RE ;
Singer, Y .
MACHINE LEARNING, 1999, 37 (03) :297-336
[29]  
Shehab MA, 2016, INT CONF COMP SCI
[30]   A survey of hierarchical classification across different application domains [J].
Silla, Carlos N., Jr. ;
Freitas, Alex A. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 22 (1-2) :31-72