AN OPTIMIZATION APPROACH TO AUTOMATIC GENERIC DOCUMENT SUMMARIZATION

被引:10
作者
Alguliev, Rasim M. [1 ]
Aliguliyev, Ramiz M. [1 ]
Mehdiyev, Chingiz A. [1 ]
机构
[1] Azerbaijan Natl Acad Sci, Inst Informat Technol, Baku 1141, Az, Azerbaijan
关键词
generic document summarization; summary diversity; redundancy; optimization models; PSO with nonlinear decreasing inertia weight; PMI-based sentence similarity measure; PARTICLE SWARM; RANKING; LEXRANK;
D O I
10.1111/j.1467-8640.2012.00437.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we have presented an optimization approach to document summarization. The potential of optimization based document summarization models has not been well explored to date. This is partially the difficulty to formulate the criteria used for objective assessment. We modeled document summarization as the linear and nonlinear optimization problems. These models generally attempt simultaneously to balance coverage and diversity in the summary. To solve the optimization problem we developed a novel particle swarm optimization (PSO) algorithm. Experiments showed our linear and nonlinear models produce very competitive results, which significantly outperform the NIST baselines in both years. More important, although linear and nonlinear models are comparable to the top three systems S24, S15, and S12 in the DUC2006, they are even superior to the best participating system in the DUC2005.
引用
收藏
页码:129 / 155
页数:27
相关论文
共 49 条
[1]  
Achananuparp Palakorn, 2010, Proceedings 2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT), P342, DOI 10.1109/WI-IAT.2010.36
[2]   System identification and control using adaptive particle swarm optimization [J].
Alfi, Alireza ;
Modares, Hamidreza .
APPLIED MATHEMATICAL MODELLING, 2011, 35 (03) :1210-1221
[3]  
Alguliev Rasim, 2009, Intelligent Information Management, V1, P128, DOI 10.4236/iim.2009.12019
[4]   Automatic Text Documents Summarization through Sentences Clustering [J].
Alguliev, R. M. ;
Alyguliev, R. M. .
JOURNAL OF AUTOMATION AND INFORMATION SCIENCES, 2008, 40 (09) :53-63
[5]  
Alguliev Rasim, 2010, INTELLIGENT CONTROL, V1, P105
[6]   MCMR: Maximum coverage and minimum redundant text summarization model [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Hajirahimova, Makrufa S. ;
Mehdiyev, Chingiz A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) :14514-14522
[7]   CLUSTERING TECHNIQUES AND DISCRETE PARTICLE SWARM OPTIMIZATION ALGORITHM FOR MULTI-DOCUMENT SUMMARIZATION [J].
Aliguliyev, Ramiz M. .
COMPUTATIONAL INTELLIGENCE, 2010, 26 (04) :420-448
[8]   A new sentence similarity measure and sentence based extractive technique for automatic text summarization [J].
Aliguliyev, Ramiz M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :7764-7772
[9]   Incorporating Prior Knowledge into a Transductive Ranking Algorithm for Multi-Document Summarization [J].
Amini, Massih-Reza ;
Usunier, Nicolas .
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, :704-705
[10]  
[Anonymous], 2008, P 17 ACM C INF KNOWL, DOI DOI 10.1145/1458082.1458319