AKEA: An Arabic Keyphrase Extraction Algorithm

被引:6
作者
Amer, Eslam [1 ]
Foad, Khaled [2 ]
机构
[1] Banha Univ, Fac Comp & Informat, Dept Comp Sci, Banha, Egypt
[2] Banha Univ, Fac Comp & Informat, Dept Informat Syst, Banha, Egypt
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016 | 2017年 / 533卷
关键词
Keyphrase extraction; Natural language processing; SYSTEM;
D O I
10.1007/978-3-319-48308-5_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyphrase extraction is a critical step in many natural language processing and Information retrieval applications. In this paper, we introduce AKEA, a keyphrase extraction algorithm for single Arabic documents. AKEA is an unsupervised algorithm as it does not need any type of training in order to achieve its task. We rely on heuristics that collaborate linguistic patterns based on Part-Of-Speech (POS) tags, statistical knowledge, and the internal structural pattern of terms (i.e. word-occurrence). We employ the usage of Arabic Wikipedia to improve the ranking (or significance) of candidate keyphrases by adding a confidence score if the candidate exist as an indexed Wikipedia concept. Experimental results show that on average AKEA has the highest precision value, the highest F-measure value which indicates it presents more accurate results compared to its other algorithms
引用
收藏
页码:137 / 146
页数:10
相关论文
共 27 条
[1]   Automatic Arabic text summarization: a survey [J].
Al-Saleh, Asma Bader ;
Menai, Mohamed El Bachir .
ARTIFICIAL INTELLIGENCE REVIEW, 2016, 45 (02) :203-234
[2]  
Babekr S., 2013, INT J ADV COMPUT SCI, V4
[3]   An Extended Keyword Extraction Method [J].
Bao Hong ;
Deng Zhen .
INTERNATIONAL CONFERENCE ON APPLIED PHYSICS AND INDUSTRIAL ENGINEERING 2012, PT B, 2012, 24 :1120-1127
[4]   Novel Word Features for Keyword Extraction [J].
Chen, Yiqun ;
Yin, Jian ;
Zhu, Weiheng ;
Qiu, Shiding .
WEB-AGE INFORMATION MANAGEMENT (WAIM 2015), 2015, 9098 :148-160
[5]   Efficient kNN classification algorithm for big data [J].
Deng, Zhenyun ;
Zhu, Xiaoshu ;
Cheng, Debo ;
Zong, Ming ;
Zhang, Shichao .
NEUROCOMPUTING, 2016, 195 :143-148
[6]   KP-Miner: A keyphrase extraction system for English and Arabic documents [J].
El-Beltagy, Samhaa R. ;
Rafea, Ahmed .
INFORMATION SYSTEMS, 2009, 34 (01) :132-144
[7]  
El-Ghannam Fatma, 2013, International Journal of Computer Science & Information Technology, V5, P77, DOI 10.5121/ijcsit.2013.5606
[8]  
Fouad K., 2012, INT J COMPUT SCI ISS, V9, P3
[9]  
Fouad K., 2013, INT J COMPUT APPL, V62, P10
[10]  
Gupta Vishal, 2009, Journal of Emerging Technologies in Web Intelligence, V1, P60, DOI 10.4304/jetwi.1.1.60-76