Multi-Document Summarization Using Sentence Clustering

被引:0
作者
Gupta, Virendra Kumar [1 ]
Siddiqui, Tanveer J. [2 ]
机构
[1] Samsung India Software Operat, Bangalore, Karnataka, India
[2] Univ Allahabad, Dept Elect & Commun, Allahabad 211002, Uttar Pradesh, India
来源
4TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2012) | 2012年
关键词
Multi document summarization; sentence clustering method; feature extraction; DUC-2002;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an approach to query focused multi document summarization by combining single document summary using sentence clustering. Both syntactic and semantic similarity between sentences is used for clustering. Single document summary is generated using document feature, sentence reference index feature, location feature and concept similarity feature. Sentences from single document summaries are clustered and top most sentences from each cluster are used for creating multi-document summary. We observed an average F-measure of 0.33774 on DUC 2002 multi-document dataset, which is comparable to three best performing systems reported on the same dataset.
引用
收藏
页数:5
相关论文
共 10 条
[1]  
Barzilay R., 1997, Intelligent Scalable Text Summarization. Proceedings of a Workshop, P10
[2]  
Harabagiu S. M., 2002, DOCUMENTUNDERSTANDIN
[3]   Sentence similarity based on semantic nets and corpus statistics [J].
Li, Yuhua ;
McLean, David ;
Bandar, Zuhair A. ;
O'Shea, James D. ;
Crockett, Keeley .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (08) :1138-1150
[4]   Measuring semantic similarity within sentences [J].
Liu, Xiao-Ying ;
Zhou, Yi-Ming ;
Zheng, Ruo-Shi .
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, :2558-+
[5]  
McKeown K., 1995, SIGIR Forum, P74
[6]   WORDNET - A LEXICAL DATABASE FOR ENGLISH [J].
MILLER, GA .
COMMUNICATIONS OF THE ACM, 1995, 38 (11) :39-41
[7]  
Patel Alkesh, 2007, P RIAO 2007 MAY 30 J
[8]   Centroid-based summarization of multiple documents [J].
Radev, DR ;
Jing, HY ;
Stys, M ;
Tam, D .
INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (06) :919-938
[9]  
Silber H. Gregory, 2002, COMPUTATIONAL LINGUI, V28, P487
[10]   Using query expansion in graph-based approach for query-focused multi-document summarization [J].
Zhao, Lin ;
Wu, Lide ;
Huang, Xuanjing .
INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (01) :35-41