An Integrated Approach for Keyphrase Generation via Exploring the Power of Retrieval and Extraction

被引:0
作者
Chen, Wang [1 ]
Chan, Hou Pong [1 ]
Li, Piji
Bing, Lidong [2 ]
King, Irwin [1 ]
机构
[1] Chinese Univ Hong Kong, Shatin, Hong Kong, Peoples R China
[2] Alibaba DAMO Acad, R&D Ctr Singapore, Machine Intelligence Technol, Hangzhou, Peoples R China
来源
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1 | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel integrated approach for keyphrase generation (KG). Unlike previous works which are purely extractive or generative, we first propose a new multitask learning framework that jointly learns an extractive model and a generative model. Besides extracting keyphrases, the output of the extractive model is also employed to rectify the copy probability distribution of the generative model, such that the generative model can better identify important contents from the given document. Moreover, we retrieve similar documents with the given document from training data and use their associated keyphrases as external knowledge for the generative model to produce more accurate keyphrases. For further exploiting the power of extraction and retrieval, we propose a neural-based merging module to combine and re-rank the predicted keyphrases from the enhanced generative model, the extractive model, and the retrieved keyphrases. Experiments on the five KG benchmarks demonstrate that our integrated approach outperforms the state-of-the-art methods.
引用
收藏
页码:2846 / 2856
页数:11
相关论文
共 38 条
  • [1] [Anonymous], 2006, ACL 2006
  • [2] [Anonymous], 2010, P 23 AAAI C ART INT, DOI DOI 10.1145/1740592.1740596
  • [3] [Anonymous], 2011, P 15 C COMP NAT LANG
  • [4] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [5] Berend G., 2011, P 5 INT JOINT C NATU, P1162
  • [6] Cao ZQ, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P152
  • [7] Chen J, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P4057
  • [8] Chen Wang, 2018, ABS180808575 CORR
  • [9] Cho K, 2014, ARXIV14061078
  • [10] Frank E, 1999, IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, P668