Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency

被引:8
作者
Wei, Bingzhen [1 ]
Ren, Xuancheng [1 ]
Zhang, Yi [1 ]
Cai, Xiaoyan [2 ]
Su, Qi [3 ]
Sun, Xu [4 ,5 ,6 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, MOE Key Lab Computat Linguist, Beijing 100871, Peoples R China
[2] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China
[3] Peking Univ, Sch Foreign Languages, Beijing 100871, Peoples R China
[4] Peking Univ, Sch EECS, MOE Key Lab Computat Linguist, Beijing 100871, Peoples R China
[5] Peking Univ, Ctr Data Sci, Beijing 100871, Peoples R China
[6] Natl Engn Lab Big Data Anal Technol & Applicat PK, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Abstractive text summarization; semantic consistency; Chinese social media text; natural language processing; AGREEMENT;
D O I
10.1145/3314934
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Abstractive text summarization is a highly difficult problem, and the sequence-to-sequence model has shown success in improving the performance on the task. However, the generated summaries are often inconsistent with the source content in semantics. In such cases, when generating summaries, the model selects semantically unrelated words with respect to the source content as the most probable output. The problem can be attributed to heuristically constructed training data, where summaries can be unrelated to the source content, thus containing semantically unrelated words and spurious word correspondence. In this article, we propose a regularization approach for the sequence-to-sequence model and make use of what the model has learned to regularize the learning objective to alleviate the effect of the problem. In addition, we propose a practical human evaluation method to address the problem that the existing automatic evaluation method does not evaluate the semantic consistency with the source content properly. Experimental results demonstrate the effectiveness of the proposed approach, which outperforms almost all the existing models. Especially, the proposed approach improves the semantic consistency by 4% in terms of human evaluation.
引用
收藏
页数:15
相关论文
共 29 条
[1]  
Aghajanyan Armen., 2017, Proceedings of the International Symposium on Robotics Research (ISRR), P1
[2]  
[Anonymous], 2017, P 2017 C EMPIRICAL M, DOI 10.18653/v1/D17-1221
[3]  
[Anonymous], 2014, ARXIV
[4]  
[Anonymous], 2016, 25 INT JOINT C ART I
[5]  
Cao ZQ, 2018, AAAI CONF ARTIF INTE, P4784
[6]  
Chopra S., 2016, P 2016 C N AM CHAPTE, P93
[7]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[8]  
Gu JT, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1631
[9]  
Hinton G., 2015, STAT, V1050, P9
[10]  
Hochreiter S, 1997, Neural Computation, V9, P1735