IMPLEMENTING RANKING STRATEGIES USING TEXT SIGNATURES
被引:32
作者:
CROFT, WB
论文数: 0引用数: 0
h-index: 0
机构:
C OLIVETTI & CO,I-56100 PISA,ITALYC OLIVETTI & CO,I-56100 PISA,ITALY
CROFT, WB
[1
]
SAVINO, P
论文数: 0引用数: 0
h-index: 0
机构:
C OLIVETTI & CO,I-56100 PISA,ITALYC OLIVETTI & CO,I-56100 PISA,ITALY
SAVINO, P
[1
]
机构:
[1] C OLIVETTI & CO,I-56100 PISA,ITALY
来源:
ACM TRANSACTIONS ON OFFICE INFORMATION SYSTEMS
|
1988年
/
6卷
/
01期
关键词:
INFORMATION RETRIEVAL SYSTEMS - Online Searching - LIBRARIES - Automation - MANAGEMENT - Information Systems - OFFICE AUTOMATION;
D O I:
10.1145/42279.45947
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Signature files provide an efficient access method for text in documents, but retrieval is usually limited to finding documents that contain a specified Boolean pattern of word. Effective retrieval requires that documents with similar meanings be found through a process of plausible inference. The simplest way of implementing this retrieval process is to rank documents in order of their probability of relevance. In this paper techniques are described for implementing probabilistic ranking strategies with sequential and bit-sliced signature files and the limitations of these implementations with regard to their effectiveness are pointed out. A detail comparison is made between signature based ranking techniques and ranking using term-based document representatives and inverted files. The comparison shows that term-based representations are at least competive (in terms of efficiency) with signature files and, in some situations, superior.