TB±tree: Index Structure for Information Retrieval Systems

被引:0
作者
Fekihal, Mabruk [1 ]
Jaluta, Ibrahim [2 ]
Saini, Dinesh Kumar [1 ]
机构
[1] Sohar Univ, Fac Comp & IT, Sohar, Oman
[2] Univ Tripoli, Dept Comp Sci, Tripoli, Libya
来源
2015 SECOND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE, COMPUTER ENGINEERING, AND SOCIAL MEDIA (CSCESM) | 2015年
关键词
Information Retrieval Systems; Inverted files; single key-word and phrase queries; Indexing; B +/- tree; TB +/- tree;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Information Retrieval Systems (IR) is using different indexing techniques to retrieve information such as, Inverted files, and Signature files. However, Signature files are suitable for small IR systems due to its slow response, while inverted file have better response time but its space overhead is high. Moreover, inverted files use B +/- trees for single-word queries. In this paper, a new indexing structure called TB +/- tree to be used in the design of inverted files for large information retrieval systems. The TB +/- tree is a variant of the B +/- tree that supports single key-word queries and phrase queries efficiently. In TB +/- tree algorithms which represent each key-word stored in the index by a numeric value, and this numeric value can be used as encryption and inforce security. The numeric value for each keyword is stored in binary format, which may reduce the size of the index file by 19%. The records in TB +/- tree may be of variable length.
引用
收藏
页码:182 / 186
页数:5
相关论文
共 15 条
[1]  
Alistair M., 1998, ACM T DATABASE SYST, V23, P453
[2]  
[Anonymous], 2008, Introduction to information retrieval
[3]  
[Anonymous], 1999, Compressing and Indexing Documents and Images
[4]  
[Anonymous], 2011, Modern Information Retrieval: The Concepts and Technology behind Search
[5]   UBIQUITOUS B-TREE [J].
COMER, D .
COMPUTING SURVEYS, 1979, 11 (02) :121-137
[6]  
Culpepper JS, 2010, LECT NOTES COMPUT SC, V6347, P194, DOI 10.1007/978-3-642-15781-3_17
[7]  
Frankes B., 1992, INFORM RETRIEVAL DAT
[8]  
Gray Jim, 1993, T PROCESSING CONCEPT
[9]  
Guttman Antonin., 1984, P 1984 ACM SIGMOD C, P47
[10]   Concurrency control and recovery for balanced B-link trees [J].
Jaluta, I ;
Sippu, S ;
Soisalon-Soininen, E .
VLDB JOURNAL, 2005, 14 (02) :257-277