Improved self-indexing inverted files for full-text retrieval

被引:0
作者
College of Compute Science, South-Central University for Nationalities, Wuhan 430074, China [1 ]
不详 [2 ]
机构
来源
J. Comput. Inf. Syst. | 2009年 / 2卷 / 1017-1024期
关键词
Indexing (of information) - Information retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Self-index is a promising way to improve retrieval time-and-space efficiency by compression index files. An improved inverted file self-index called IFSI is proposed for full-text information retrieval. IFSI includes two level indexes: the first level index which contains a subset of the documents that are likely to be returned as top results; and the second level index which includes the surplus documents. IFSI can create a skipped index on each compressed posting list with very little or no storage overhead with efficient coding scheme. IFSI also supports efficient incremental updates with allocating free space efficiently at the tail of post lists based on statistics-based approach. Detailed simulation results and comparison with other schemes prove that the proposed IFSI can not only greatly reduce decompress time, but also simultaneously allow extremely fast query processing. © 2009 Binary Information Press March, 2009.
引用
收藏
相关论文
empty
未找到相关数据