Information Filtering and Query Indexing for an Information Retrieval Model

被引:11
作者
Tryfonopoulos, Christos [1 ]
Koubarakis, Manolis [2 ]
Drougas, Yannis [3 ]
机构
[1] Max Planck Inst Informat, Databases & Informat Syst Dept, D-66123 Saarbrucken, Germany
[2] Natl & Kapodistrian Univ Athens, Dept Informat & Telecommunicat, Athens 15784, Greece
[3] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
关键词
Algorithms; Performance; Information filtering; selective dissemination of information; query indexing algorithms; performance evaluation; TRIE; DISSEMINATION; COMPLEXITY; DOCUMENTS; SYSTEMS;
D O I
10.1145/1462198.1462202
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the information filtering paradigm, clients subscribe to a server with continuous queries or profiles that express their information needs. Clients can also publish documents to servers. Whenever a document is published, the continuous queries satisfying this document are found and notifications are sent to appropriate clients. This article deals with the filtering problem that needs to be solved efficiently by each server: Given a database of continuous queries db and a document d, find all queries q epsilon db that match d. We present data structures and indexing algorithms that enable us to solve the filtering problem efficiently for large databases of queries expressed in the model AWP. AWP is based on named attributes with values of type text, and its query language includes Boolean and word proximity operators.
引用
收藏
页数:47
相关论文
共 96 条
[61]  
Milios E., 2003, P 6 C PAC ASS COMP L, P275
[62]  
Morita M., 1994, P 17 ANN INT ACM SIG, P272
[63]   Proximal nodes: A model to query document databases by content and structure [J].
Navarro, G ;
BaezaYates, R .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1997, 15 (04) :400-435
[64]  
NGUYEN B, 2001, P ACM SIGMOD C SANT
[65]   IP-address lookup using LC-tries [J].
Nilsson, S ;
Karlsson, G .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1999, 17 (06) :1083-1092
[66]   COMPUTER-PROGRAMS FOR DETECTING AND CORRECTING SPELLING-ERRORS [J].
PETERSON, JL .
COMMUNICATIONS OF THE ACM, 1980, 23 (12) :676-687
[67]   SEARCHING STRUCTURED DOCUMENTS WITH THE ENHANCED RETRIEVAL FUNCTIONALITY OF FREE WAIS-SF AND SFGATE [J].
PFEIFER, U ;
FUHR, N ;
HUYNH, T .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1995, 27 (06) :1027-1036
[68]  
PIETZUCH P, 2002, P 1 INT WORKSH DISTR
[69]  
RAFTOPOULOU P, 2008, P 12 EUR C RES ADV T
[70]  
Ratnasamy S., 2001, P ACM SIGCOMM C