KeyGraph: Automatic indexing by co-occurrence graph based on building construction metaphor

被引:237
作者
Ohsawa, Y [1 ]
Benson, NE [1 ]
Yachida, M [1 ]
机构
[1] Osaka Univ, Grad Sch Engn Sci, Dept Syst & Human Sci, Osaka 5608531, Japan
来源
IEEE INTERNATIONAL FORUM ON RESEARCH AND TECHNOLOGY ADVANCES IN DIGITAL LIBRARIES -ADL'98-, PROCEEDINGS | 1998年
关键词
D O I
10.1109/ADL.1998.670375
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present an algorithm for extracting keywords representing the asserted main point in a document, without relying on external devices such as natural language processing tools or a document corpus. Our algorithm KeyGraph is based on the segmentation of a graph, representing the co-occurrence between terms in a document, into clusters. Each cluster corresponds to a concept on which author's idea is based, and top ranked terms by a statistic based on each term's relationship to these clusters are selected as keywords. This strategy comes from considering that a document is constructed like a building for expressing new ideas based on traditional concepts. The experimental results show that thus extracted terms match author's point quite accurately, even though KeyGraph does not use each term's average frequency in a corpus, i.e., KeyGraph is a content-sensitive, domain independent device of indexing.
引用
收藏
页码:12 / 18
页数:7
相关论文
empty
未找到相关数据