A METHOD FOR SEARCH ENGINE RETRIEVAL SYSTEM BASED ON HYBRID TRIE-INVERTED FILE

被引:0
作者
Yang, Hongwei [1 ]
Zhou, Hua [1 ]
机构
[1] Yunnan Univ, Sch Software, Kunming 650021, Peoples R China
来源
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2 | 2009年
关键词
Search engine; Inverted file; Hybrid Trie; WEB; COMMUNICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Due to the rapid growth in the size of the web, web search engines are facing enormous performance challenges. This paper presents an inverted file method to improve the effectiveness of retrieving documents. It integrates the effects of the document identifier data and document weight data to improve the efficiency of retrieving process. Moreover, we propose the Hybrid Trie-Inverted file (HTI) index because the evaluation of containment queries relies on merge-joining the inverted lists, and then we conduct several experiments to evaluate our proposed approach. The experimental results show that it can get higher performance than traditional Inverted file index.
引用
收藏
页码:1663 / 1668
页数:6
相关论文
共 21 条
  • [1] Scientific research activity and communication measured with cybermetrics indicators
    Aguillo, Isidro F.
    Granadino, Begona
    Ortega, Jose L.
    Prieto, Jose A.
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (10): : 1296 - 1302
  • [2] ARASU A, 2003, ACM T INTERNET TECHN, V1, P2
  • [3] Evolution, continuity, and disappearance of documents on a specific topic on the web: A longitudinal study of "informetrics"
    Bar-Ilan, J
    Peritz, BC
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (11): : 980 - 990
  • [4] BARILAN J, SEARCH ENGINE RESULT
  • [5] BARJAK F, 2008, J AM SOC IN IN PRESS
  • [6] Toward a basic framework for webometrics
    Björneborn, L
    Ingwersen, P
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (14): : 1216 - 1227
  • [7] 'Mini small worlds' of shortest link paths crossing domain boundaries in an academic Web space
    Bjorneborn, Lennart
    [J]. SCIENTOMETRICS, 2006, 68 (03) : 395 - 414
  • [8] Graph structure in the Web
    Broder, A
    Kumar, R
    Maghoul, F
    Raghavan, P
    Rajagopalan, S
    Stata, R
    Tomkins, A
    Wiener, J
    [J]. COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6): : 309 - 320
  • [9] Chakrabarti S.:., 2003, Mining the Web: Analysis of Hypertext and Semi Structured Data
  • [10] CHO J., 2004, P WORLD WID WEB C MA