Zero-Shot Cross-Lingual Neural Headline Generation

被引:39
作者
Ayana [1 ,2 ]
Shen, Shi-qi [1 ]
Chen, Yun [3 ]
Yang, Cheng [1 ]
Liu, Zhi-yuan [1 ,4 ]
Sun, Mao-song [1 ,4 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Inner Mongolia Univ Finance & Econ, Dept Comp Informat Management, Hohhot 010051, Peoples R China
[3] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Hong Kong, Peoples R China
[4] Heilongjiang Univ, Sch Comp Sci & Technol, Harbin 150080, Heilongjiang, Peoples R China
关键词
Neural network; headline generation; cross-lingual;
D O I
10.1109/TASLP.2018.2842432
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Neural headline generation (NHG) has been proven to be effective in generating a fully abstractive headline recently. Existing NHG systems are only capable of producing headline of the same language as the original document. Cross lingual headline generation is an important task since it provides an efficient way to understand the key point of a document in a different language. Due to the lack of those parallel corpora of direct source language articles and target language headlines, we propose to deal with the cross-lingual neural headline generation (CNHG) under the zero-shot scenario. A trivial solution is to translate and summarize the source document in a pipeline way. However, a pipeline solution will lead to error propagation in the translation and summarization phases. This challenge motivates us to build a direct source-to-target CNHG model based on existing parallel corpora of translation and monolingual headline generation. Specifically, we let a parameterized CNHG model (student model) mimic the output of a pretrained translation or headline generation model (teacher model). To the best of our knowledge, this is the first effort to address CNHG problem. Besides, we construct English-Chinese headline generation evaluation datasets by manual translation. Experimental results on English-to-Chinese cross-lingual headline generation demonstrate that our proposed method significantly outperforms the baseline models.
引用
收藏
页码:2319 / 2327
页数:9
相关论文
共 33 条
[1]  
[Anonymous], 2012, CoRR
[2]  
[Anonymous], 2016, P 2016 EMNLP
[3]  
[Anonymous], 2015, PROC 2015 C EMPIRICA
[4]  
[Anonymous], 2015, NeurIPS
[5]   Recent Advances on Neural Headline Generation [J].
Ayana ;
Shen, Shi-Qi ;
Lin, Yan-Kai ;
Tu, Cun-Chao ;
Zhao, Yu ;
Liu, Zhi-Yuan ;
Sun, Mao-Song .
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (04) :768-784
[6]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[7]  
Chen H., 2016, P C EMP METH NAT LAN, P1650, DOI DOI 10.18653/V1/D16-1171
[8]   A Teacher-Student Framework for Zero-Resource Neural Machine Translation [J].
Chen, Yun ;
Liu, Yang ;
Cheng, Yong ;
Li, Victor O. K. .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1925-1935
[9]  
Cho K., 2014, ARXIV14061078, P1724, DOI 10.3115/V1/D14-1179
[10]  
Chopra S., 2016, P 2016 C N AM CHAPT, P93, DOI DOI 10.18653/V1/N16-1012