Extractive Multi-Document Summarization: A Review of Progress in the Last Decade

被引:5
作者
Jalil, Zakia [1 ]
Nasir, Jamal Abdul [1 ]
Nasir, Muhammad [1 ]
机构
[1] Int Islamic Univ, Dept Comp Sci & Software Engn, Islamabad 44000, Pakistan
关键词
Semantics; Ontologies; Redundancy; Data mining; Task analysis; Natural language processing; Licenses; Abstractive summarization; clustering; extractive summarization; graph-based; machine learning; multi-document summarization; natural language processing; ontology; term-based; DIFFERENTIAL EVOLUTION; ARCHETYPAL ANALYSIS; MAXIMUM COVERAGE; TEXT; GRAPH; FRAMEWORK; REDUNDANCY; ALGORITHM; RELEVANCE; SEARCH;
D O I
10.1109/ACCESS.2021.3112496
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the tremendous growth in the number of electronic documents, it is becoming challenging to manage the volume of information. Much research has focused on automatically summarizing the information available in the documents. Multi-Document Summarization (MDS) is one approach that aims to extract the information from the available documents in such a concise way that none of the important points are missed from the summary while avoiding the redundancy of information at the same time. This study presents an extensive survey of extractive MDS over the last decade to show the progress of research in this field. We present different techniques of extractive MDS and compare their strengths and weaknesses. Research work is presented by category and evaluated to help the reader understand the work in this field and to guide them in defining their own research directions. Benchmark datasets and standard evaluation techniques are also presented. This study concludes that most of the extractive MDS techniques are successful in developing salient and information-rich summaries of the documents provided.
引用
收藏
页码:130928 / 130946
页数:19
相关论文
共 50 条
  • [41] A Hybrid Solution To Abstractive Multi-Document Summarization Using Supervised and Unsupervised Learning
    Bhagchandani, Gaurav
    Bodra, Deep
    Gangan, Abhishek
    Mulla, Nikahat
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 566 - 570
  • [42] An Improved LDA Multi-Document Summarization Model Based on TensorFlow
    Zhong, Ying
    Tang, Zhuo
    Ding, Xiaofei
    Zhu, Li
    Le, Yuquan
    Li, Kenli
    Li, Keqin
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 255 - 259
  • [43] A multi-document summarization system based on statistics and linguistic treatment
    Ferreira, Rafael
    Cabral, Luciano de Souza
    Freitas, Frederico
    Lins, Rafael Dueire
    Silva, Gabriel de Franca
    Simske, Steven J.
    Favaro, Luciano
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (13) : 5780 - 5787
  • [44] Multi-Document Summarization for Turkish News
    Demirci, Ferhat
    Karabudak, Engin
    Ilgen, Bahar
    2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [45] Weighted consensus multi-document summarization
    Wang, Dingding
    Li, Tao
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (03) : 513 - 523
  • [46] Caviar-Sunflower Optimization Algorithm-Based Deep Learning Classifier for Multi-Document Summarization
    Sheela, J.
    Janet, B.
    COMPUTER JOURNAL, 2023, 66 (03) : 727 - 742
  • [47] Multi-document summarization via submodularity
    Li, Jingxuan
    Li, Lei
    Li, Tao
    APPLIED INTELLIGENCE, 2012, 37 (03) : 420 - 430
  • [48] Multi-document summarization via submodularity
    Jingxuan Li
    Lei Li
    Tao Li
    Applied Intelligence, 2012, 37 : 420 - 430
  • [49] Multi-Document Summarization by Information Distance
    Long, Chong
    Huang, Minlie
    Zhu, Xiaoyan
    Li, Ming
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 866 - +
  • [50] MULTI-DOCUMENT SUMMARIZATION SYSTEMS COMPARISON
    Li, Lei
    Heng, Wei
    Liu, Ping'an
    2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1409 - 1413