AUTOMATIC TEXT SUMMARIZATION USING SUPPORT VECTOR MACHINE

被引:0
作者
Begum, Nadira [1 ]
Fattah, Mohamed Abdel [1 ,2 ]
Ren, Fuji [1 ,3 ]
机构
[1] Univ Tokushima, Fac Engn, Tokushima 7708506, Japan
[2] Helwan Univ, FIE, Cairo, Egypt
[3] Beijing Univ Posts & Telecommun, Sch Informat Engn, Beijing 100088, Peoples R China
来源
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL | 2009年 / 5卷 / 07期
基金
日本学术振兴会;
关键词
Automatic summarization; Support vector machine; Text features; SENTENCE COMPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work investigates different text features to select the best one and proposes an approach to address automatic text summarization. This approach is a trainable summarizer, which takes into account several features, including sentence position, sentence centrality, sentence resemblance to the title, sentence inclusion of name entity, sentence inclusion of numerical data, sentence relative length, Bushy path of the sentence and aggregated similarity for each sentence to generate summaries. First we investigate the effect of each sentence feature on the summarization task. Then we use all features score function to train Support Vector Machine (SVM) in order to construct a text summarizer model. The proposed approach performance is measured at several compression rates (CR) on a data corpus composed of 100 English articles from the domain of politics.
引用
收藏
页码:1987 / 1996
页数:10
相关论文
共 22 条
[1]  
Brown M., 2000, IEEE T GEOSCIENCE RE, V38
[2]  
Chen RC, 2008, INT J INNOV COMPUT I, V4, P413
[3]   A relative evaluation of multiclass image classification by support vector machines [J].
Foody, GM ;
Mathur, A .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2004, 42 (06) :1335-1343
[4]   Satisfying information needs with multi-document summaries [J].
Harabagiu, Sanda ;
Hickl, Andrew ;
Lacatusu, Finley .
INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) :1619-1642
[5]   Supervised automatic evaluation for summarization with voted regression model [J].
Hirao, Tsutomu ;
Okumura, Manabu ;
Yasuda, Norihito ;
Isozaki, Hideki .
INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) :1521-1535
[6]   Task-based evaluation of text summarization using relevance prediction [J].
Hobson, Stacy President ;
Dorr, Bonnie J. ;
Monz, Christof ;
Schwartz, Richard .
INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) :1482-1499
[7]   Generating gene summaries from biomedical literature: A study of semi-structured summarization [J].
Ling, Xu ;
Jiang, Jing ;
He, Xin ;
Mei, Qiaozhu ;
Zhai, Chengxiang ;
Schatz, Bruce .
INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) :1777-1791
[8]   Summarizing court decisions [J].
Moens, Marie-Francine .
INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) :1748-1764
[9]   Nonlinear prediction of chaotic time series using support vector machines [J].
Mukherjee, S ;
Osuna, E ;
Girosi, F .
NEURAL NETWORKS FOR SIGNAL PROCESSING VII, 1997, :511-520
[10]   Discriminative sentence compression with conditional random fields [J].
Nomoto, Tadashi .
INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) :1571-1587