Query-oriented text summarization based on multiobjective evolutionary algorithms and word embeddings

被引:7
作者
Fors-Isalguez, Yanet [1 ]
Hermosillo-Valadez, Jorge [1 ]
Montes-y-Gomez, Manuel [2 ]
机构
[1] Univ Autonoma Estado Morelos, Ctr Invest Ciencias IICBA, Av Univ 1001, Cuernavaca 62209, Morelos, Mexico
[2] Inst Nacl Astrofis Opt & Electr, Puebla, Mexico
关键词
Query-oriented multi-document summarization; multiobjective-optimization; sentence embedded representation;
D O I
10.3233/JIFS-169506
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic text summarization systems are nowadays of great help to extract relevant information from large corpora. Many solutions to the task have been proposed from the perspective of the optimization of a single-objective function, aiming at finding the global optimum. This is an unrealistic goal since when multiple objectives are considered a solution that optimizes one of the objectives may induce the opposite effect on the others. Recently other solutions have been proposed that involve multiple, conflicting objectives, but which eventually are aggregated into a scalar function thus resulting in a single-objective optimization problem. Furthermore, oftentimes a typical bag of words model is used and little effort has been made to include semantic relations between sentences to improve performance. In this paper a novel method for query-oriented summarization is proposed as a multiobjective optimization problem taking into account the Pareto front and based on an embedded representation of sentences. The method is evaluated with the TAC 2009 dataset. Experimental results show that the approach contributes to improve performance significantly. To the authors' knowledge, the method is the first attempt to include embedded representations of sentences in a multiobjective optimization solution, which applies the Pareto approach to query-oriented summarization.
引用
收藏
页码:3235 / 3244
页数:10
相关论文
共 28 条
[1]   Multiple documents summarization based on evolutionary optimization algorithm [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Isazade, Nijat R. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) :1675-1689
[2]   An unsupervised approach to generating generic summaries of documents [J].
Alguliyev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Isazade, Nijat R. .
APPLIED SOFT COMPUTING, 2015, 34 :236-250
[3]  
[Anonymous], 2007, EVOLUTIONARY ALGORIT
[4]   Improving Performance of Text Summarization [J].
Babar, S. A. ;
Patil, Pallavi D. .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 :354-363
[5]  
Baroni M, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P238
[6]   Learning from homologous queries and semantically related terms for query auto completion [J].
Cai, Fei ;
de Rijke, Maarten .
INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (04) :628-643
[7]  
Carbonell J., 1998, P 21 ANN INT ACM SIG, P235
[8]  
Dang H., 2009, P 2 TEXT AN C TAC200
[9]   A fast and elitist multiobjective genetic algorithm: NSGA-II [J].
Deb, K ;
Pratap, A ;
Agarwal, S ;
Meyarivan, T .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (02) :182-197
[10]   LexRank: Graph-based lexical centrality as salience in text summarization [J].
Erkan, G ;
Radev, DR .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :457-479