A Cooperative Schema between Web Sever and Search Engine for Improving Freshness of Web Repository

被引:1
|
作者
WEN Kun-mei
机构
关键词
search engine; freshness; cooperative schema;
D O I
暂无
中图分类号
TP393.092 [];
学科分类号
080402 ;
摘要
Because the web is huge and web pages are updated frequently, the index maintained by a search engine has to refresh web pages periodically. This is extremely resource consuming because the search engine needs to crawl the web and download web pages to refresh its index. Based on present technologies of web refreshing, we present a cooperative schema between web server and search engine for maintaining freshness of web repository. The web server provides meta-data defined through XML standard to describe web sites. Before updating the web page the crawler visits the meta-data files. If the meta-data indicates that the page is not modified, then the crawler will not update it. So this schema can save bandwidth resource. A primitive model based on the schema is implemented. The cost and efficiency of the schema are analyzed.
引用
收藏
页码:11 / 14
页数:4
相关论文
共 50 条
  • [21] Web Search Engine Research
    Smith, Jill A.
    LIBRARY QUARTERLY, 2014, 84 (02): : 250 - 252
  • [22] Hierarchical structural approach to improving the browsability of Web search engine results
    Cui, H
    Zaïane, OR
    12TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2001, : 956 - 960
  • [23] Optimization of Web Search Engine and Its Application to Web Mining
    CHEN Hao1
    2. Software School
    3. Department of Computer Science and Technology
    WuhanUniversityJournalofNaturalSciences, 2009, 14 (02) : 115 - 118
  • [24] Search Engine for Amharic Web Content
    Redwan, Hassen
    Mindaye, Tessema
    Atnafu, Solomon
    2009 AFRICON, VOLS 1 AND 2, 2009, : 630 - 635
  • [25] PyThinSearch: A Simple Web Search Engine
    Mirzal, Andri
    CISIS: 2009 INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, VOLS 1 AND 2, 2009, : 1 - 8
  • [26] Web search engine based on DNS
    Wang Liang
    Guo Yi-Ping
    Fang Ming
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (02) : 466 - 478
  • [27] ExpertRec: A Collaborative Web Search Engine
    Sun, Jingyu
    Chen, Junjie
    Yu, Xueli
    Zhong, Ning
    WEB INFORMATION SYSTEMS AND MINING, PT II, 2011, 6988 : 385 - +
  • [28] MediCrawl - A Web Search Engine For Diseases
    Trivedi, Devharsh
    Gopalakrishnan, Vaishnavi
    2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 148 - 157
  • [29] DEWS: A Decentralized Engine for Web Search
    Ahmed, Reaz
    Bari, Md Faizul
    Haque, Rakibul
    Boutaba, Raouf
    Mathieu, Bertrand
    2014 10TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2014, : 254 - 259
  • [30] Web searching on the vivisimo search engine
    Koshman, Sherry
    Spink, Amanda
    Jansen, Bernard J.
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (14): : 1875 - 1887