Question Headline Generation for News Articles

被引:16
作者
Zhang, Ruqing [1 ,2 ]
Guo, Jiafeng [1 ,2 ]
Fan, Yixing [1 ,2 ]
Lan, Yanyan [1 ,2 ]
Xu, Jun [1 ,2 ]
Cao, Huanhuan [3 ]
Cheng, Xueqi [1 ,2 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, CAS Key Lab Network Data Sci & Technol, Beijing, Peoples R China
[3] ByteDance Inc, Beijing, Peoples R China
来源
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT | 2018年
基金
中国国家自然科学基金;
关键词
Question headline generation; self-attention mechanism;
D O I
10.1145/3269206.3271711
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we introduce and tackle the Question Headline Generation (QHG) task. The motivation comes from the investigation of a real-world news portal where we find that news articles with question headlines often receive much higher click-through ratio than those with non-question headlines. The QHG task can be viewed as a specific form of the Question Generation (QG) task, with the emphasis on creating a natural question from a given news article by taking the entire article as the answer. A good QHG model thus should be able to generate a question by summarizing the essential topics of an article. Based on this idea, we propose a novel dual-attention sequence-to-sequence model (DASeq2Seq) for the QHG task. Unlike traditional sequence-to-sequence models which only employ the attention mechanism in the decoding phase for better generation, our DASeq2Seq further introduces a self-attention mechanism in the encoding phase to help generate a good summary of the article. We investigate two ways of the self-attention mechanism, namely global self-attention and distributed self-attention. Besides, we employ a vocabulary gate over both generic and question vocabularies to better capture the question patterns. Through the offline experiments, we show that our approach can significantly outperform the state-of-the-art question generation or headline generation models. Furthermore, we also conduct online evaluation to demonstrate the effectiveness of our approach using A/B test.
引用
收藏
页码:617 / 626
页数:10
相关论文
共 54 条
  • [31] Heilman M., 2010, HUMAN LANGUAGE TECHN, P609
  • [32] Jing HY, 1999, SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P129, DOI 10.1145/312624.312666
  • [33] Kingma D. P., 2015, 3 INT C LEARN REPR I, P1
  • [34] What makes you click? The effect of question headlines on readership in computer-mediated communication
    Lai, Linda
    Farbrot, Audun
    [J]. SOCIAL INFLUENCE, 2014, 9 (04) : 289 - 299
  • [35] Lin Z., 2017, 5 INT C LEARNING REP
  • [36] Lindberg D, 2013, P 14 EUR WORKSH NAT, P105
  • [37] Lindberg D., 2013, GENERATING NATURAL L
  • [38] THE AUTOMATIC CREATION OF LITERATURE ABSTRACTS
    LUHN, HP
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1958, 2 (02) : 159 - 165
  • [39] Mannem Prashanth, 2010, Proceedings of QG2010: The Third Workshop on Question Generation, P84
  • [40] IMPROVEMENT OF AUTOMATIC ABSTRACTS BY USE OF STRUCTURAL-ANALYSIS
    MATHIS, BA
    RUSH, JE
    YOUNG, CE
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1973, 24 (02): : 101 - 109