Comparative Evaluation of Term-Weighting Methods for Automatic Summarization

被引：9

作者：

Orasan, Constantin ^{[1
]}

机构：

[1] Wolverhampton Univ, Res Grp Computat Linguist, Wolverhampton WV1 1SB, England

来源：

JOURNAL OF QUANTITATIVE LINGUISTICS | 2009年 / 16卷 / 01期

关键词：

RELEVANCE;

D O I：

10.1080/09296170802514187

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

Term-based summarization assumes that it is possible to determine the importance of a sentence on the basis of the words it contains. To achieve this, words are weighted using term-weighting measures which in turn are used to weight the sentences. This article presents a comparative evaluation of summaries produced using different term-weighting measures and different combinations of parameters which are used to calculate these measures. Comparative evaluation of summaries produced reveals that in many cases simple methods such as term frequency can produce informative summaries.

引用

页码：67 / 95

页数：29

共 40 条

[1]

[Anonymous], 1997, 5 C APPL NAT LANG PR, DOI DOI 10.3115/974557.974599

[2]

[Anonymous], P 16 ANN INT ACM SIG

[3]

[Anonymous], 2001, Automatic Summarization

[4]

[Anonymous], 2003, How much information

[5]

[Anonymous], 1992, Information retrieval: Data structures and algorithms

[6]

[Anonymous], 1997, EUROPEAN ASS ARCHAEO

[7] MACHINE-MADE INDEX FOR TECHNICAL LITERATURE - AN EXPERIMENT [J].

BAXENDALE, PB .

IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1958, 2 (04) :354-361

[8]

Black W. J., 1988, Expert Systems for Information Management, V1, P159

[9] AUTOMATIC CONDENSATION OF ELECTRONIC PUBLICATIONS BY SENTENCE SELECTION [J].

BRANDOW, R ;

MITZE, K ;

RAU, LF .

INFORMATION PROCESSING & MANAGEMENT, 1995, 31 (05) :675-685

[10]

BUCKLEY C, 1985, 85686 CORN U

← 1 2 3 4 →