MCRMR: Maximum coverage and relevancy with minimal redundancy based multi-document summarization

被引:45
作者
Verma, Pradeepika [1 ]
Om, Hari [1 ]
机构
[1] IIT ISM, Dept Comp Sci & Engn, Dhanbad, Bihar, India
关键词
Multi-document summarization; Word Mover's distance; Normalized google distance; Shark smell optimization; Coverage; Non-redundancy; INFORMATION-RETRIEVAL; TEXT; FRAMEWORK; REVIEWS; SINGLE;
D O I
10.1016/j.eswa.2018.11.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel extraction based method for multi-document summarization that covers three important features of a good summary: coverage, non-redundancy, and relevancy. The coverage and non-redundancy features are modeled to generate a single document from the multiple documents. These features are explored by the weighted combination of word embedding and Google based similarity methods. To accommodate the relevancy feature in the system generated summaries, the text summarization task is modeled as an optimization problem, where various text features with their optimized weights are used to score the sentences to find the relevant sentences. For feature's weight optimization, we use the meta-heuristic approach, Shark Smell Optimization (SSO). The experiments are performed on six benchmark datasets (DUC04, DUC06, DUC07, TAC08, TAC11, and MultiLing13) with the co-selection and content based performance parameters. The experimental results show that the proposed approach is viable and effective for multi-document summarization. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:43 / 56
页数:14
相关论文
共 61 条
[31]   Extractive multi-document summarization using population-based multicriteria optimization [J].
John, Ansamma ;
Premjith, P. S. ;
Wilscy, M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 86 :385-397
[32]  
Karkaletsis V., 2011, AUTOSUMMENG MEMOG EV
[33]  
Kenter T., 2015, CIKM 15CIKM, P1411, DOI DOI 10.1145/2806416.2806475
[34]   A framework for multi-document abstractive summarization based on semantic role labelling [J].
Khan, Atif ;
Salim, Naomie ;
Kumar, Yogan Jaya .
APPLIED SOFT COMPUTING, 2015, 30 :737-747
[35]  
Kobayashi H., 2015, P 2015 C EMP METH NA
[36]  
Kogilavani A., 2010, Proceedings of the 2010 International Conference on Communication and Computational Intelligence (INCOCCI), P324
[37]  
Kusner MJ, 2015, PR MACH LEARN RES, V37, P957
[38]  
Lee G.H., 2017, PROC 8 INT JOINT C N, V2, P193
[39]   Graph Summarization Methods and Applications: A Survey [J].
Liu, Yike ;
Safavi, Tara ;
Dighe, Abhilash ;
Koutra, Danai .
ACM COMPUTING SURVEYS, 2018, 51 (03)
[40]   Towards automatic tweet generation: A comparative study from the text summarization perspective in the journalism genre [J].
Lloret, Elena ;
Palomar, Manuel .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (16) :6624-6630