Parallelizing a multi-objective optimization approach for extractive multi-document text summarization

被引:11
作者
Sanchez-Gomez, Jesus M. [1 ]
Vega-Rodriguez, Miguel A. [1 ]
Perez, Carlos J. [2 ]
机构
[1] Univ Extremadura, Dept Comp & Commun Technol, Campus Univ S-N, Caceres 10003, Spain
[2] Univ Extremadura, Dept Math, Campus Univ S-N, Caceres 10003, Spain
关键词
Parallel computing; Multi-document; Text summarization; Multi-objective optimization; Artificial bee colony; MAXIMUM COVERAGE; ALGORITHM; SELECTION;
D O I
10.1016/j.jpdc.2019.09.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Currently, automatic multi-document text summarization is an important task in many fields of knowledge, due to the continuous exponential growth of information on the Internet. Nevertheless, this task is computationally demanding. In the last years, automatic text summarization has been addressed by using multi-objective optimization approaches. In particular, recently, the Multi-Objective Artificial Bee Colony (MOABC) algorithm has obtained very good results. This work focuses on the parallelization of this approach. Several steps have been carried out for this goal. After a time profiling of the algorithm, a runtime comparison has been performed between the use of different random number generators within the algorithm. Then, a parallel implementation of the MOABC algorithm has been designed following its original scheme, in which the main steps are parallelized, and different parallel schedules have been studied and compared. Finally, a second design based on the asynchronous behavior of the bee colony in nature has been implemented and compared. Experiments have been carried out with datasets from Document Understanding Conference (DUC). The results show that the asynchronous design improves greatly the parallel design, being more than 55 times faster with 64 threads than the standard design. An efficiency of 86.72% has been reported for 64 threads. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:166 / 179
页数:14
相关论文
共 34 条
[1]  
Alguliev R. M., 2012, Problems of Information Technology, P20
[2]  
Alguliev R. M., 2011, APPL COMPUTATIONAL I, V2011, P1
[3]   AN OPTIMIZATION APPROACH TO AUTOMATIC GENERIC DOCUMENT SUMMARIZATION [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Mehdiyev, Chingiz A. .
COMPUTATIONAL INTELLIGENCE, 2013, 29 (01) :129-155
[4]   Formulation of document summarization as a 0-1 nonlinear programming problem [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Isazade, Nijat R. .
COMPUTERS & INDUSTRIAL ENGINEERING, 2013, 64 (01) :94-102
[5]   Multiple documents summarization based on evolutionary optimization algorithm [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Isazade, Nijat R. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) :1675-1689
[6]   Sentence selection for generic document summarization using an adaptive differential evolution algorithm [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Mehdiyev, Chingiz A. .
SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (04) :213-222
[7]   GenDocSum plus MCLR: Generic document summarization based on maximum coverage and less redundancy [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Hajirahimova, Makrufa S. .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (16) :12460-12473
[8]   MCMR: Maximum coverage and minimum redundant text summarization model [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Hajirahimova, Makrufa S. ;
Mehdiyev, Chingiz A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) :14514-14522
[9]   An unsupervised approach to generating generic summaries of documents [J].
Alguliyev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Isazade, Nijat R. .
APPLIED SOFT COMPUTING, 2015, 34 :236-250
[10]  
[Anonymous], 2008, USING OPENMP PORTABL