Enhanced Genetic Algorithm for Single Document Extractive Summarization

被引:3
作者
Bui Thi Mai Anh [1 ]
Nguyen Tra My [1 ]
Nguyen Thi Thu Trang [1 ]
机构
[1] Hanoi Univ Sci & Technol, Hanoi, Vietnam
来源
SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY | 2019年
关键词
Sentence Extraction; Extractive Summarization; Genetic Algorithm;
D O I
10.1145/3368926.3369729
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In extractive summarization, summaries are generated by selecting the most salient sentences from the original text. The text summarization can be seen as a classification of sentences into two groups: in-summary/not-in-summary. Many approaches have been proposed to extract key sentences in which using Genetic Algorithms (GAs) has shown some promising results. In this paper, we propose an enhanced genetic algorithm in order to improve the quality of extractive text summarization. More concisely, we first evaluate the role of some sentence features and their contribution to improve the fitness function. We second investigate some crossover and mutation mechanisms in order to augment the accuracy of summarization as well as the performance of our model. The experiment has been conducted for the Daily Mail dataset to assess the proposed model and previous works. The empirical results show that our proposed GA gives better accuracy in comparison with TextRank and SummaRunNer, i.e., increasing the accuracy by 7.2% and 6.9% respectively.
引用
收藏
页码:370 / 376
页数:7
相关论文
共 23 条
[1]  
[Anonymous], 2015, NIPS
[2]  
[Anonymous], 2006, GENETIC ALGORITHMS
[3]  
[Anonymous], 2015, ARXIV150900685, DOI DOI 10.18653/V1/D15-1044
[4]  
Bajpai Pratibha, 2010, INDIAN J COMPUTER SC, V1, P3
[5]  
BUCKLAND M, 1994, J AM SOC INFORM SCI, V45, P12, DOI 10.1002/(SICI)1097-4571(199401)45:1<12::AID-ASI2>3.0.CO
[6]  
2-L
[7]  
Chatterjee N, 2012, PROC INT CONF EMERG, P19, DOI 10.1109/EAIT.2012.6407852
[8]   LexRank: Graph-based lexical centrality as salience in text summarization [J].
Erkan, G ;
Radev, DR .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :457-479
[9]  
Garcia-Hernandez Rene Arnulfo, 2013, Pattern Recognition. 5th Mexican Conference, MCPR 2013. Proceedings: LNCS 7914, P374, DOI 10.1007/978-3-642-38989-4_38
[10]  
Kalyanmoy D., 1998, P INT WORKSHOP SOFT, P58