ADDING COMPRESSION TO A FULL-TEXT RETRIEVAL-SYSTEM

被引:53
|
作者
ZOBEL, J [1 ]
MOFFAT, A [1 ]
机构
[1] UNIV MELBOURNE,DEPT COMP SCI,PARKVILLE,VIC 3052,AUSTRALIA
来源
SOFTWARE-PRACTICE & EXPERIENCE | 1995年 / 25卷 / 08期
关键词
FULL-TEXT RETRIEVAL; DATA COMPRESSION; TEXT COMPRESSION; HUFFMAN CODING; WORD-BASED MODEL;
D O I
10.1002/spe.4380250804
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We describe the implementation of a data compression scheme as an integral and transparent layer within a full-text retrieval system. Using a semi-static word-based compression model, the space needed to store the text is under 30 per cent of the original requirement. The model is used in conjunction with canonical Huffman coding and together these two paradigms provide fast decompression. Experiments with 500 Mb of newspaper articles show that in full-text retrieval environments compression not only saves space, it can also yield faster query processing - a win-win situation.
引用
收藏
页码:891 / 903
页数:13
相关论文
共 50 条
  • [1] OPTOELECTRONIC FULL-TEXT RETRIEVAL-SYSTEM
    KIM, YW
    BERRA, PB
    OPTICAL ENGINEERING, 1992, 31 (05) : 906 - 914
  • [2] A SYSTEMATIC-APPROACH TO COMPRESSING A FULL-TEXT RETRIEVAL-SYSTEM
    BOOKSTEIN, A
    KLEIN, ST
    ZIFF, DA
    INFORMATION PROCESSING & MANAGEMENT, 1992, 28 (06) : 795 - 806
  • [3] THE RESPONSA PROJECT - A FULL-TEXT RETRIEVAL-SYSTEM FOR HEBREW CASE LAW
    BORKO, H
    PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1985, 22 : 367 - 368
  • [4] HECATE: A FULL-TEXT RETRIEVAL SYSTEM FOR SHORT TEXT
    Wang, Song
    Xiong, Yongping
    PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS AND INFORMATION TECHNOLOGY PROCESSING (AMITP 2016), 2016, 60 : 395 - 405
  • [5] InfoBee/TR - a full-text retrieval system
    NTT Human Interface Labs
    NTT R&D, 10 (1103-1108):
  • [6] Full-text Retrieval System for Humanities Researches
    Murakawa, Takehiko
    Watagami, Yukiharu
    Utsunomiya, Keigo
    Nakagawa, Masaru
    KNOWLEDGE-BASED SOFTWARE ENGINEERING, 2012, 240 : 118 - +
  • [7] DATA-COMPRESSION IN FULL-TEXT RETRIEVAL-SYSTEMS
    BELL, TC
    MOFFAT, A
    NEVILLMANNING, CG
    WITTEN, IH
    ZOBEL, J
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1993, 44 (09): : 508 - 531
  • [8] FULL-TEXT INFORMATION RETRIEVAL
    FAY, RJ
    LAW LIBRARY JOURNAL, 1971, 64 (02): : 167 - 175
  • [9] Harvesting for full-text retrieval
    Simeoni, F
    Yakici, M
    Neely, S
    Crestani, F
    DIGITAL LIBRARIES: IMPLEMENTING STRATEGIES AND SHARING EXPERIENCES, PROCEEDINGS, 2005, 3815 : 204 - 213
  • [10] FULL-TEXT ONLINE RETRIEVAL
    COLBERT, AW
    ONLINE, 1988, 12 (02): : 91 - 91