Generating hierarchical document indices from common denominators in large document collections

被引:1
作者
OKane, KC
机构
[1] Department of Computer Science, University of Northern Iowa, Cedar Falls
关键词
D O I
10.1016/0306-4573(95)00032-C
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes an effective, simple and efficient algorithm for computer generation of hierarchical indices from Document-Term matrices by means of calculating common denominator vectors from the document vector set. This procedure produces an intuitive, user-friendly hierarchical index of a document collection not unlike that which would be expected had a manual indexer set about to create an index or outline of a collection. The resulting index, when presented with a graphical user interface, provides the user with a natural easily comprehended view of the document collection that permits general browsing and informal search activities with an access method that requires no keyboard entry or prior knowledge of the vocabulary.
引用
收藏
页码:105 / 115
页数:11
相关论文
共 6 条
[1]   DYNAMIC CLUSTER MAINTENANCE [J].
CAN, F ;
OZKARAHAN, EA .
INFORMATION PROCESSING & MANAGEMENT, 1989, 25 (03) :275-291
[2]   AN ANALYSIS OF APPROXIMATE VERSUS EXACT DISCRIMINATION VALUES [J].
CROUCH, CJ .
INFORMATION PROCESSING & MANAGEMENT, 1988, 24 (01) :5-16
[3]   COMPARISON OF HIERARCHIC AGGLOMERATIVE CLUSTERING METHODS FOR DOCUMENT-RETRIEVAL [J].
ELHAMDOUCHI, A ;
WILLETT, P .
COMPUTER JOURNAL, 1989, 32 (03) :220-227
[4]   THE STATE OF RETRIEVAL-SYSTEM EVALUATION [J].
SALTON, G .
INFORMATION PROCESSING & MANAGEMENT, 1992, 28 (04) :441-449
[5]  
SALTON G, 1983, INTRO MODERN INFORMA
[6]  
Salton G., 1988, AUTOMATIC TEXT PROCE