Extractive Multi-Document Summarization: A Review of Progress in the Last Decade

被引:6
作者
Jalil, Zakia [1 ]
Nasir, Jamal Abdul [1 ]
Nasir, Muhammad [1 ]
机构
[1] Int Islamic Univ, Dept Comp Sci & Software Engn, Islamabad 44000, Pakistan
关键词
Semantics; Ontologies; Redundancy; Data mining; Task analysis; Natural language processing; Licenses; Abstractive summarization; clustering; extractive summarization; graph-based; machine learning; multi-document summarization; natural language processing; ontology; term-based; DIFFERENTIAL EVOLUTION; ARCHETYPAL ANALYSIS; MAXIMUM COVERAGE; TEXT; GRAPH; FRAMEWORK; REDUNDANCY; ALGORITHM; RELEVANCE; SEARCH;
D O I
10.1109/ACCESS.2021.3112496
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the tremendous growth in the number of electronic documents, it is becoming challenging to manage the volume of information. Much research has focused on automatically summarizing the information available in the documents. Multi-Document Summarization (MDS) is one approach that aims to extract the information from the available documents in such a concise way that none of the important points are missed from the summary while avoiding the redundancy of information at the same time. This study presents an extensive survey of extractive MDS over the last decade to show the progress of research in this field. We present different techniques of extractive MDS and compare their strengths and weaknesses. Research work is presented by category and evaluated to help the reader understand the work in this field and to guide them in defining their own research directions. Benchmark datasets and standard evaluation techniques are also presented. This study concludes that most of the extractive MDS techniques are successful in developing salient and information-rich summaries of the documents provided.
引用
收藏
页码:130928 / 130946
页数:19
相关论文
共 80 条
[61]  
Tzouridis Emmanouil., 2014, COLING 2014, 25th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers, August 23-29, 2014, Dublin, Ireland, P1636
[62]   MCRMR: Maximum coverage and relevancy with minimal redundancy based multi-document summarization [J].
Verma, Pradeepika ;
Om, Hari .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 120 :43-56
[63]  
Wan Xiaojun, 2008, EMNLP, P755
[64]  
Wang BY, 2017, CAAI T INTELL TECHNO, V2, P26, DOI 10.1016/j.trit.2016.12.005
[65]  
Wang Dengting, 2009, Proceedings of the 5th International Conference on Asian and Pacific Coasts. APAC 2009, P297, DOI 10.1142/9789814287951_0129
[66]   Weighted consensus multi-document summarization [J].
Wang, Dingding ;
Li, Tao .
INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (03) :513-523
[67]   Integrating Document Clustering and Multidocument Summarization [J].
Wang, Dingding ;
Zhu, Shenghuo ;
Li, Tao ;
Chi, Yun ;
Gong, Yihong .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2011, 5 (03)
[68]   A Cross-Layer Design of Channel Assignment and Routing in Cognitive Radio Networks [J].
Wang, Jiao ;
Huang, Yuqing .
PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 7, 2010, :542-547
[69]   A document-sensitive graph model for multi-document summarization [J].
Wei, Furu ;
Li, Wenjie ;
Lu, Qin ;
He, Yanxiang .
KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 22 (02) :245-259
[70]   Ontology-enriched multi-document summarization in disaster management using submodular function [J].
Wu, Keshou ;
Li, Lei ;
Li, Jingxuan ;
Li, Tao .
INFORMATION SCIENCES, 2013, 224 :118-129