Automatic keyphrase extraction: a survey and trends

被引:0
作者
Zakariae Alami Merrouni
Bouchra Frikh
Brahim Ouhbi
机构
[1] Sidi Mohamed Ben Abdellah University,TTI Laboratory, Higher School of Technology (EST)
[2] Moulay Ismail University (UMI),Mathematical Modeling & Computer Laboratory (LM2I), National Higher School of Arts and Crafts (ENSAM)
来源
Journal of Intelligent Information Systems | 2020年 / 54卷
关键词
Information retrieval; Natural language processing; Text mining; Automatic keyphrase extraction; Supervised approaches; Unsupervised approaches; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Due to the exponential growth of textual data and web sources, an automatic mechanism is required to identify relevant information embedded within them. The utility of Automatic Keyphrase Extraction (AKPE) cannot be overstated, given its widespread adoption in many Information Retrieval (IR), Natural Language Processing (NLP) and Text Mining (TM) applications, and its potential ability to solve difficulties related to extracting valuable information. In recent years, a wide range of AKPE techniques have been proposed. However, they are still impaired by low accuracy rates and moderate performance. This paper provides a comprehensive review of recent research efforts on the AKPE task and its related techniques. More concretely, we highlight the common process of this task, while also illustrating the various approaches used (supervised, unsupervised, and Deep Learning) and released techniques. We investigate the major challenges that such techniques face and depict the specific complexities they address. Besides, we provide a comparison study of the best performing techniques, discuss why some perform better than others and propose recommendations to improve each stage of the AKPE process.
引用
收藏
页码:391 / 424
页数:33
相关论文
共 70 条
[1]  
Blei DM(2003)Latent dirichlet allocation Journal of Machine Learning Research 3 993-1022
[2]  
Ng AY(1998)The anatomy of a large-scale hypertextual web search engine Computer Networks and ISDN Systems 30 107-117
[3]  
Jordan MI(2006)System and method for query refinement to enable improved searching based on identifying and utilizing popular concepts related to users’ queries US Patent 7 136,845-185
[4]  
Brin S(2015)Latent keyphrase extraction using LDA model Journal of Korean Institute of Intelligent Systems 25 180-144
[5]  
Page L(2009)KP-MINER: A keyphrase extraction system for English and Arabic documents Information Systems 34 132-211
[6]  
Chandrasekar R(1990)Finding structure in time Cognitive science 14 179-418
[7]  
James CFI(2010)Detection of access to terror-related web sites using an advanced terror detection system (ATDS) Journal of the association for information science and technology 61 405-1170
[8]  
Watson EB(2011)A new methodology for domain ontology construction from the Web International Journal on Artificial Intelligence Tools 20 1157-132
[9]  
Cho T(2009)Improving keyword based web image search with visual feature distribution and term expansion Knowledge and Information Systems 21 113-104
[10]  
Lee JH(1999)Improving browsing in digital libraries with keyphrase indexes Decision Support Systems 27 81-500