Extractive Multi-Document Summarization: A Review of Progress in the Last Decade

被引:5
作者
Jalil, Zakia [1 ]
Nasir, Jamal Abdul [1 ]
Nasir, Muhammad [1 ]
机构
[1] Int Islamic Univ, Dept Comp Sci & Software Engn, Islamabad 44000, Pakistan
关键词
Semantics; Ontologies; Redundancy; Data mining; Task analysis; Natural language processing; Licenses; Abstractive summarization; clustering; extractive summarization; graph-based; machine learning; multi-document summarization; natural language processing; ontology; term-based; DIFFERENTIAL EVOLUTION; ARCHETYPAL ANALYSIS; MAXIMUM COVERAGE; TEXT; GRAPH; FRAMEWORK; REDUNDANCY; ALGORITHM; RELEVANCE; SEARCH;
D O I
10.1109/ACCESS.2021.3112496
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the tremendous growth in the number of electronic documents, it is becoming challenging to manage the volume of information. Much research has focused on automatically summarizing the information available in the documents. Multi-Document Summarization (MDS) is one approach that aims to extract the information from the available documents in such a concise way that none of the important points are missed from the summary while avoiding the redundancy of information at the same time. This study presents an extensive survey of extractive MDS over the last decade to show the progress of research in this field. We present different techniques of extractive MDS and compare their strengths and weaknesses. Research work is presented by category and evaluated to help the reader understand the work in this field and to guide them in defining their own research directions. Benchmark datasets and standard evaluation techniques are also presented. This study concludes that most of the extractive MDS techniques are successful in developing salient and information-rich summaries of the documents provided.
引用
收藏
页码:130928 / 130946
页数:19
相关论文
共 50 条
  • [31] A Game Theory Approach for Multi-document Summarization
    Ahmad, Amreen
    Ahmad, Tanvir
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (04) : 3655 - 3667
  • [32] Multi-document summarization based on unsupervised clustering
    Ji, Paul
    INFORMATION RETRIEVAL TECHNOLOLGY, PROCEEDINGS, 2006, 4182 : 560 - 566
  • [33] Decomposition-based multi-objective differential evolution for extractive multi-document automatic text summarization
    Wahab, Muhammad Hafizul Hazmi
    Hamid, Nor Asilah Wati Abdul
    Subramaniam, Shamala
    Latip, Rohaya
    Othman, Mohamed
    APPLIED SOFT COMPUTING, 2024, 151
  • [34] Solving Multi-Document Summarization as an Orienteering Problem
    Al-Saleh, Asma
    Menai, Mohamed El Bachir
    ALGORITHMS, 2018, 11 (07)
  • [35] Mining Both Commonality and Specificity From Multiple Documents for Multi-Document Summarization
    Ma, Bing
    IEEE ACCESS, 2024, 12 : 54371 - 54381
  • [36] Abstractive Multi-Document Summarization Based on Semantic Link Network
    Li, Wei
    Zhuge, Hai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (01) : 43 - 54
  • [37] Research on multi-document summarization merging the sentential semantic features
    Luo S.-L.
    Bai J.-M.
    Pan L.-M.
    Han L.
    Meng Q.
    2016, Beijing Institute of Technology (36): : 1059 - 1064
  • [38] Exploring events and distributed representations of text in multi-document summarization
    Marujo, Luis
    Ling, Wang
    Ribeiro, Ricardo
    Gershman, Anatole
    Carbonell, Jaime
    de Matos, David Martins
    Neto, Joao P.
    KNOWLEDGE-BASED SYSTEMS, 2016, 94 : 33 - 42
  • [39] Automatic Multi-Document Summarization for Indonesian Documents Using Hybrid Abstractive-Extractive Summarization Technique
    Yapinus, Glorian
    Erwin, Alva
    Galinium, Maulahikmah
    Muliady, Wahyu
    2014 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2014, : 39 - 43
  • [40] A preference learning approach to sentence ordering for multi-document summarization
    Bollegala, Danushka
    Okazaki, Naoaki
    Ishizuka, Mitsuru
    INFORMATION SCIENCES, 2012, 217 : 78 - 95