IMPLEMENTING RANKING STRATEGIES USING TEXT SIGNATURES

被引:32
作者
CROFT, WB [1 ]
SAVINO, P [1 ]
机构
[1] C OLIVETTI & CO,I-56100 PISA,ITALY
来源
ACM TRANSACTIONS ON OFFICE INFORMATION SYSTEMS | 1988年 / 6卷 / 01期
关键词
INFORMATION RETRIEVAL SYSTEMS - Online Searching - LIBRARIES - Automation - MANAGEMENT - Information Systems - OFFICE AUTOMATION;
D O I
10.1145/42279.45947
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Signature files provide an efficient access method for text in documents, but retrieval is usually limited to finding documents that contain a specified Boolean pattern of word. Effective retrieval requires that documents with similar meanings be found through a process of plausible inference. The simplest way of implementing this retrieval process is to rank documents in order of their probability of relevance. In this paper techniques are described for implementing probabilistic ranking strategies with sequential and bit-sliced signature files and the limitations of these implementations with regard to their effectiveness are pointed out. A detail comparison is made between signature based ranking techniques and ranking using term-based document representatives and inverted files. The comparison shows that term-based representations are at least competive (in terms of efficiency) with signature files and, in some situations, superior.
引用
收藏
页码:42 / 62
页数:21
相关论文
共 33 条