Query-oriented text summarization based on multiobjective evolutionary algorithms and word embeddings

被引:7
作者
Fors-Isalguez, Yanet [1 ]
Hermosillo-Valadez, Jorge [1 ]
Montes-y-Gomez, Manuel [2 ]
机构
[1] Univ Autonoma Estado Morelos, Ctr Invest Ciencias IICBA, Av Univ 1001, Cuernavaca 62209, Morelos, Mexico
[2] Inst Nacl Astrofis Opt & Electr, Puebla, Mexico
关键词
Query-oriented multi-document summarization; multiobjective-optimization; sentence embedded representation;
D O I
10.3233/JIFS-169506
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic text summarization systems are nowadays of great help to extract relevant information from large corpora. Many solutions to the task have been proposed from the perspective of the optimization of a single-objective function, aiming at finding the global optimum. This is an unrealistic goal since when multiple objectives are considered a solution that optimizes one of the objectives may induce the opposite effect on the others. Recently other solutions have been proposed that involve multiple, conflicting objectives, but which eventually are aggregated into a scalar function thus resulting in a single-objective optimization problem. Furthermore, oftentimes a typical bag of words model is used and little effort has been made to include semantic relations between sentences to improve performance. In this paper a novel method for query-oriented summarization is proposed as a multiobjective optimization problem taking into account the Pareto front and based on an embedded representation of sentences. The method is evaluated with the TAC 2009 dataset. Experimental results show that the approach contributes to improve performance significantly. To the authors' knowledge, the method is the first attempt to include embedded representations of sentences in a multiobjective optimization solution, which applies the Pareto approach to query-oriented summarization.
引用
收藏
页码:3235 / 3244
页数:10
相关论文
共 28 条
[11]  
Freitas A. A., 2004, SIGKDD Explor. Newsl., V6, P77, DOI DOI 10.1145/1046456.1046467
[12]  
Gillick Daniel, 2008, TAC
[13]   Modeling Document Summarization as Multi-objective Optimization [J].
Huang, Lei ;
He, Yanxiang ;
Wei, Furu ;
Li, Wenjie .
2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, :382-386
[14]  
Kumar V. S. Raj, 2016, ELK ASIA PACIFIC J C, V2, P31
[15]  
Levy O., 2015, Transactions of the Association for Computational Linguistics, V3, P211, DOI [DOI 10.1162/TACL_A_00134, DOI 10.1162/TACLA00134]
[16]  
Lin C.-Y., 2004, Proceedings of the workshop on text summarization branches out, P74
[17]  
Lin CY, 2003, HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, P150
[18]  
Mashreghi Z., 2017, INT J COMPUTER APPL, V159, P38
[19]  
McDonald R, 2007, LECT NOTES COMPUT SC, V4425, P557
[20]  
Mikolov T., 2013, P 2013 C N AM CHAPT