Bi-LSTM-CRF Sequence Labeling for Keyphrase Extraction from Scholarly Documents

被引:109
作者
Al-Zaidy, Rabah A. [1 ]
Caragea, Cornelia [2 ]
Giles, C. Lee [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
[2] Univ Illinois, Chicago, IL 60680 USA
来源
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019) | 2019年
关键词
Keyphrase extraction; sequence labeling; deep learning;
D O I
10.1145/3308558.3313642
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we address the keyphrase extraction problem as sequence labeling and propose a model that jointly exploits the complementary strengths of Conditional Random Fields that capture label dependencies through a transition parameter matrix consisting of the transition probabilities from one label to the neighboring label, and Bidirectional Long Short Term Memory networks that capture hidden semantics in text through the long distance dependencies. Our results on three datasets of scholarly documents show that the proposed model substantially outperforms strong baselines and previous approaches for keyphrase extraction.
引用
收藏
页码:2551 / 2557
页数:7
相关论文
共 46 条
[1]  
Adar E, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P606
[2]   Multi-Task Learning of Keyphrase Boundary Classification [J].
Augenstein, Isabelle ;
Sogaard, Anders .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, :341-346
[3]  
Barker Ken, 2000, USING NOUN SE HEADS, P40
[4]  
Bhaskar P., 2012, P COLING, P17
[5]   A Comparison of Supervised Keyphrase Extraction Models [J].
Bulgarov, Florin ;
Caragea, Cornelia .
WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, :13-14
[6]  
Caragea C., 2014, P 2014 C EMP METH NA, P1435, DOI [DOI 10.3115/V1/D14-1150, 10.3115/v1/d14-1150]
[7]  
Cho K, 2014, ARXIV14061078
[8]  
Danesh S., 2015, P 4 JOINT C LEX COMP P 4 JOINT C LEXICAL, P117
[9]  
Das Gollapalli S, 2014, AAAI CONF ARTIF INTE, P1629
[10]  
El-Beltagy S.R., 2010, 5 INT WORKSH SEM EV, P190