GSPSummary: A graph-based sub-topic partition algorithm for summarization

被引:0
作者
Zhang, Jin [1 ]
Cheng, Xueqi [1 ]
Xu, Hongbo [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
来源
INFORMATION RETRIEVAL TECHNOLOGY | 2008年 / 4993卷
关键词
multi-document summarization; sub-topic; graph representation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-document summarization (MDS) is a challenging research topic in natural language processing. In order to obtain an effective summary, this paper presents a novel extractive approach based on graph-based sub-topic partition algorithm (GSPSummary). In particular, a sub-topic model based on graph representation is presented with emphasis on the implicit logic structure of the topic covered in the document collection. Then, a new framework of MDS with sub-topic partition is proposed. Furthermore, a novel scalable ranking criterion is adopted, in which both word based features and global features are integrated together. Experimental results on DUC2005 show that the proposed approach can significantly outperform existing approaches of the top performing systems in DUC tasks.
引用
收藏
页码:321 / 334
页数:14
相关论文
共 17 条
[1]  
[Anonymous], 2004, J ARTIFICIAL INTELLI
[2]  
BARZILAY R, 1999, P ACL 199 JUN 16 20
[3]  
CARBONELL J, 1998, P SIGIR 1998 AUG
[4]  
Dang H., 2005, OVERVIEW DUC 2005
[5]  
GOLDSTEIN J, 2000, P CIKM 2000
[6]  
Harabagiu S, 2005, P SIGIR 2005
[7]  
Lin C.Y., 2004, P WORKSH TEXT SUMM B
[8]  
LIN CY, 2000, P 18 COLING C SAARBR
[9]  
Mani I., 2001, Proceedings of the 2001 ACM CIKM. Tenth International Conference on Information and Knowledge Management, P529, DOI 10.1145/502585.502677
[10]  
Mani I., 1999, ADV AUTOMATIC TEXT S