Keyphrase Extraction Based on Prior Knowledge

被引:8
作者
He, Guoxiu [1 ]
Fang, Junwei [1 ]
Cui, Haoran [1 ]
Wu, Chuan [1 ]
Lu, Wei [1 ]
机构
[1] Wuhan Univ, Sch Informat Management, Wuhan, Hubei, Peoples R China
来源
JCDL'18: PROCEEDINGS OF THE 18TH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES | 2018年
基金
中国国家自然科学基金;
关键词
Keyphrase Extraction; Prior Knowledge; TF-IDF; TextRank; Supervised Learning Algorithm;
D O I
10.1145/3197026.3203869
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Keyphrase is an important way to quickly get the topic of a document by providing highly-summative information. The previous approaches for keyphrase extraction simply rank keyphrases according to statistics-based model or graph-based model, which ignore the influence of external knowledge. In this paper, we take prior knowledge, which contains controlled vocabulary of keyphrases and their prior probability, into consideration to enhance previous methods. First, we build a controlled vocabulary of keyphrases introduced by keyphrases from existing collections and a keyphrase candidate set is filtered from a given document by it. Then, we use prior probability to represent the importance of keyphrases candidate with TF-IDF and TextRank. Finally, a supervised learning algorithm is used to learn optimal weights of these three features. Experiments on four benchmark datasets show the great advantages of prior knowledge on keyphrase extraction. Furthermore, we achieve competitive performance compared with the state-of-the art methods.
引用
收藏
页码:341 / 342
页数:2
相关论文
共 7 条
[1]  
[Anonymous], 2009, Technical report
[2]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[3]  
Frank E, 1999, IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, P668
[4]  
Hulth A, 2003, PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, P216
[5]   Deep Keyphrase Generation [J].
Meng, Rui ;
Zhao, Sanqiang ;
Han, Shuguang ;
He, Daqing ;
Brusilovsky, Peter ;
Chi, Yu .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :582-592
[6]  
Mihalcea R., 2004, EMNLP 2004
[7]  
Nguyen TD, 2007, LECT NOTES COMPUT SC, V4822, P317