Inferring strategies for sentence ordering in multidocument news summarization

被引:129
作者
Barzilay, R [1 ]
Elhadad, N [1 ]
McKeown, KR [1 ]
机构
[1] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
Algorithms - Approximation theory - Constraint theory - Information analysis - Strategic planning - Text processing;
D O I
10.1613/jair.991
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of organizing information for multidocument summarization so that the generated summary is coherent has received relatively little attention. While sentence ordering for single document summarization can be determined from the ordering of sentences in the input article, this is not the case for multidocument summarization where summary sentences may be drawn from different input articles. In this paper, we propose a methodology for studying the properties of ordering information in the news genre and describe experiments done on a corpus of multiple acceptable orderings we developed for the task. Based on these experiments, we implemented a strategy for ordering information that combines constraints from chronological order of events and topical relatedness. Evaluation of our augmented algorithm shows a significant improvement of the ordering over two baseline strategies.
引用
收藏
页码:35 / 55
页数:21
相关论文
共 31 条
[1]  
Barzilay Regina, 1999, P 37 ANN M ASS COMP
[2]  
BOUAYADAGHA N, 2000, P 1 INT C NAT LANG G
[3]  
Carbonell J. G., 1998, P 21 ANN INT ACM SIG
[4]   Learning to order things [J].
Cohen, WW ;
Schapire, RE ;
Singer, Y .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 10 :243-270
[5]  
Dale Robert, 1992, GENERATING REFERRING
[6]  
DUBOUE P, 2001, P ACL EACL 2001
[7]  
Elhadad Noemie, 2001, P NAACL 2001 WORKSH
[8]  
FILATOVA E, 2001, P AACL EACL 2001 WOR
[9]  
Galil Z., 1977, Theoretical Computer Science, V5, P179, DOI 10.1016/0304-3975(77)90005-6
[10]  
Halliday M.A.K., 1976, Cohesion in English