Enhance Content Selection for Multi-Document Summarization with Entailment Relation

被引:4
作者
Wang, Yu-Yun [1 ]
Wu, Jhen-Yi [1 ]
Chou, Tzu-Hsuan [1 ]
Lin, Ying-Jia [1 ]
Kao, Hung-Yu [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan
来源
2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020) | 2020年
关键词
abstractive summarization; entailment relation; multi-document summarization;
D O I
10.1109/TAAI51410.2020.00030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic text summarization is one of the common tasks in natural language processing. The main task is to generate a shorter version based on the original text and maintain relevant information. This paper studies multi-document summarization (MDS) that applies to news articles. MDS has two significant issues which are information overlap and information difference among multiple articles. Existing models mostly deal with MDS from the perspective of single document summarization (SDS). The models do not consider the relation between sentences in multiple news articles. Our proposed method deals with the issue and consists of two models. The sentence selector model selects representative sentences based on the entailment relation in different articles. The content is related to the event of the article extracted through the algorithm. The summary generator model generates a final summary to ensure that the summary contains no redundancy and maintains vital information. Experiment results show that our proposed model has effectively improved in the evaluation results. The main contribution of our approach is to use the entailment relation to obtain key content in multiple articles. Adding semantic comprehension can identify salient information clearly and improve the accuracy of MDS.
引用
收藏
页码:119 / 124
页数:6
相关论文
共 23 条
[1]  
[Anonymous], 2011, ARXIV14090473
[2]  
[Anonymous], 2019, CoRR
[3]  
Cao ZQ, 2017, AAAI CONF ARTIF INTE, P3053
[4]  
Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025
[5]  
Cho K., 2014, C EMP METH NAT LANG, P1724, DOI [10.3115/v1/d14-1179, DOI 10.3115/V1/D14-1179]
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]   LexRank: Graph-based lexical centrality as salience in text summarization [J].
Erkan, G ;
Radev, DR .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :457-479
[8]  
Fabbri AR, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1074
[9]  
Gehrmann S, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P4098
[10]   OpenNMT: Open-Source Toolkit for Neural Machine Translation [J].
Klein, Guillaume ;
Kim, Yoon ;
Deng, Yuntian ;
Senellart, Jean ;
Rush, Alexander M. .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, :67-72