Comparative Evaluation of Term-Weighting Methods for Automatic Summarization

被引:9
作者
Orasan, Constantin [1 ]
机构
[1] Wolverhampton Univ, Res Grp Computat Linguist, Wolverhampton WV1 1SB, England
关键词
RELEVANCE;
D O I
10.1080/09296170802514187
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Term-based summarization assumes that it is possible to determine the importance of a sentence on the basis of the words it contains. To achieve this, words are weighted using term-weighting measures which in turn are used to weight the sentences. This article presents a comparative evaluation of summaries produced using different term-weighting measures and different combinations of parameters which are used to calculate these measures. Comparative evaluation of summaries produced reveals that in many cases simple methods such as term frequency can produce informative summaries.
引用
收藏
页码:67 / 95
页数:29
相关论文
共 40 条
[1]  
[Anonymous], 1997, 5 C APPL NAT LANG PR, DOI DOI 10.3115/974557.974599
[2]  
[Anonymous], P 16 ANN INT ACM SIG
[3]  
[Anonymous], 2001, Automatic Summarization
[4]  
[Anonymous], 2003, How much information
[5]  
[Anonymous], 1992, Information retrieval: Data structures and algorithms
[6]  
[Anonymous], 1997, EUROPEAN ASS ARCHAEO
[7]   MACHINE-MADE INDEX FOR TECHNICAL LITERATURE - AN EXPERIMENT [J].
BAXENDALE, PB .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1958, 2 (04) :354-361
[8]  
Black W. J., 1988, Expert Systems for Information Management, V1, P159
[9]   AUTOMATIC CONDENSATION OF ELECTRONIC PUBLICATIONS BY SENTENCE SELECTION [J].
BRANDOW, R ;
MITZE, K ;
RAU, LF .
INFORMATION PROCESSING & MANAGEMENT, 1995, 31 (05) :675-685
[10]  
BUCKLEY C, 1985, 85686 CORN U