Two uses of anaphora resolution in summarization

被引:86
作者
Steinberger, Josef
Poesio, Massimo
Kabadjov, Mijail A.
Jezek, Karel
机构
[1] Univ W Bohemia, Plzen 30614, Czech Republic
[2] Univ Essex, Colchester CO4 3SQ, Essex, England
[3] Univ Trent, I-38100 Rovereto, TN, Italy
基金
英国工程与自然科学研究理事会;
关键词
summarization; latent semantic analysis; singular value decomposition; anaphora resolution;
D O I
10.1016/j.ipm.2007.01.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new method for using anaphoric information in Latent Semantic Analysis (LSA), and discuss its application to develop an LSA-based summarizer which achieves a significantly better performance than a system not using anaphoric information, and a better performance by the ROUGE measure than all but one of the single-document summarizers participating in DUC-2002. Anaphoric information is automatically extracted using a new release of our own anaphora resolution system, GUITAR, which incorporates proper noun resolution. Our summarizer also includes a new approach for automatically identifying the dimensionality reduction of a document on the basis of the desired summarization percentage. Anaphoric information is also used to check the coherence of the summary produced by our summarizer, by a reference checker module which identifies anaphoric resolution errors caused by sentence extraction. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1663 / 1680
页数:18
相关论文
共 28 条
[1]  
BALDWIN B, 1998, P EMNLP GRAN SPAIN
[2]  
Barzilay R., 1997, P ACL EACL WORKSH IN
[3]  
BERGLER S, 2003, P DUC EDM CAN
[4]   Using linear algebra for intelligent information retrieval [J].
Berry, MW ;
Dumais, ST ;
OBrien, GW .
SIAM REVIEW, 1995, 37 (04) :573-595
[5]  
Boguraev B., 1999, ADV AUTOMATIC TEXT S
[6]  
BONTCHEVA K, 2002, CHAM REF RES AN WORK
[7]  
CHARNIAK E, 2000, P NAACL PHIL US
[8]  
CHOI FYY, 2001, P EMNLP PITTSB US
[9]   A probabilistic model for Latent Semantic Indexing [J].
Ding, CHQ .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2005, 56 (06) :597-608
[10]  
HASLER L, 2003, P CORP LING UK LANC