A Cooperative Schema between Web Sever and Search Engine for Improving Freshness of Web Repository

被引:1
|
作者
WEN Kun-mei
机构
关键词
search engine; freshness; cooperative schema;
D O I
暂无
中图分类号
TP393.092 [];
学科分类号
080402 ;
摘要
Because the web is huge and web pages are updated frequently, the index maintained by a search engine has to refresh web pages periodically. This is extremely resource consuming because the search engine needs to crawl the web and download web pages to refresh its index. Based on present technologies of web refreshing, we present a cooperative schema between web server and search engine for maintaining freshness of web repository. The web server provides meta-data defined through XML standard to describe web sites. Before updating the web page the crawler visits the meta-data files. If the meta-data indicates that the page is not modified, then the crawler will not update it. So this schema can save bandwidth resource. A primitive model based on the schema is implemented. The cost and efficiency of the schema are analyzed.
引用
收藏
页码:11 / 14
页数:4
相关论文
共 50 条
  • [1] A cooperative schema between web sever and search engine for improving freshness of web repository
    College of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
    Wuhan Univ J Nat Sci, 2006, 1 (11-14):
  • [2] The freshness of web search engine databases
    Lewandowski, D
    Wahlig, H
    Meyer-Bautor, G
    JOURNAL OF INFORMATION SCIENCE, 2006, 32 (02) : 131 - 148
  • [3] EMACrawler: web search engine database freshness optimization
    Alanoglu, Zuelfue
    Akcayol, M. Ali
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2024, 27 (06):
  • [4] Internet search engine freshness by web server help
    Gupta, V
    Campbell, R
    2001 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2001, : 113 - 119
  • [5] Analysis of Web freshness strategies and its improvement in search engine
    Wen, Kunmei
    Lu, Zhengding
    Ye, Weiguo
    Jin, Li
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2002, 30 (12):
  • [6] A three-year study on the freshness of web search engine databases
    Lewandowski, Dirk
    JOURNAL OF INFORMATION SCIENCE, 2008, 34 (06) : 817 - 831
  • [7] Cooperation schemes between a Web server and a Web search engine
    Castillo, C
    FIRST LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2003, : 212 - 213
  • [8] The Infocious Web search engine: Improving Web searching through linguistic analysis
    Ntoulas, Alexandros
    Chao, Gerald
    Cho, Junghoo
    Journal of Digital Information Management, 2007, 5 (05): : 277 - 291
  • [9] Using web archive for improving search engine results
    Jatowt, A
    Kawail, Y
    Tanaka, K
    FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 893 - 898
  • [10] Capturing Page Freshness for Web Search
    Dai, Na
    Davison, Brian D.
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 871 - 872