A Novel Architecture for Search Engine using Domain Based Web Log Data

被引:1
|
作者
Sharma, Prem [1 ]
Yadav, Divakar [2 ]
机构
[1] Veer Madho Singh Bhandari Uttarakhand Tech Univ, Comp Sci & Engn, Sudhowala, India
[2] Indira Gandhi Natl Open Univ, Sch Comp & Informat Sci, New Delhi, India
关键词
Search engine; information retrieval; web usage mining; content mining; RANKING; USAGE;
D O I
10.34028/iajit/20/1/10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Search engines, an information retrieval tool are the main source of information for users' information need now a day. For every query, the search engine explores its repository and/or indexer to find the relevant documents/URLs for that query. Page ranking algorithms rank the Uniform Resource Locator in abstract section (URLs) according to its relevancy with respect to users' query. It is analyzed that many of the queries fired by users on search engines are duplicate. There is a scope to improve the performance of search engine to reduce its efforts for duplicate queries. In this paper a proxy server is created that keep store the search results of user queries in web log. The proposed proxy server uses this web log to find results faster for duplicate queries fired next time. The proposed scheme has been tested and found prominent. The proposed architecture tested for ten duplicate user queries. it return all relevant web pages for duplicate user query (if query is found in web log at proxy server) from a particular domain instead of entire database. It reduces the perceived latency for duplicate query and also improves the value of precession and accuracy up to 81.8% and 99% respectively for all duplicate user queries.
引用
收藏
页码:92 / 101
页数:10
相关论文
共 50 条
  • [1] Enhancing Mobile Search Using Web Search Log Data
    Inagaki, Yoshiyuki
    Bian, Jiang
    Chang, Yi
    Maki, Motoko
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1201 - 1202
  • [2] Search Engine Pictures: Empirical Analysis of a Web Search Engine Query Log
    Shoeleh, Farzaneh
    Zahedi, Mohammad Sadegh
    Farhoodi, Mojgan
    2017 3RD INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2017, : 90 - 95
  • [3] Mining Domain Terminologies Using Search Engine's Query Log
    Ni, Weijian
    Liu, Tong
    Zeng, Qingtian
    Xie, Nengfu
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
  • [4] An architecture of a Web-based collaborative image search engine
    Lai, WC
    Sychay, G
    Chang, E
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2002: COOPLS, DOA, AND ODBASE, 2002, 2519 : 391 - 409
  • [5] Web Service Architecture for a Meta Search Engine
    Srinivas, K.
    Srinivas, P. V. S.
    Govardhan, A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2011, 2 (10) : 31 - 36
  • [6] TripClick: The Log Files of a Large Health Web Search Engine
    Rekabsaz, Navid
    Lesota, Oleg
    Schedl, Markus
    Brassey, Jon
    Eickhoff, Carsten
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2507 - 2513
  • [7] Time series analysis of a Web search engine transaction log
    Zhang, Ying
    Jansen, Bernard J.
    Spink, Amanda
    INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (02) : 230 - 245
  • [8] Personalized Intelligent Search Engine Based on Web Data Mining
    Zhang, Hong
    Ma, Yanhong
    Zhang, Qiuyu
    Xie, Pengshou
    Bao, Zhongxian
    PROCEEDINGS OF 2009 INTERNATIONAL WORKSHOP ON INFORMATION SECURITY AND APPLICATION, 2009, : 584 - 587
  • [9] A Service-Based Architecture for Multi-domain Search on the Web
    Bozzon, Alessandro
    Brambilla, Marco
    Corcoglioniti, Francesco
    Vadacca, Salvatore
    SERVICE-ORIENTED COMPUTING - ICSOC 2010, PROCEEDINGS, 2010, 6470 : 663 - 669
  • [10] Architecture of a grid-enabled Web search engine
    Cambazoglu, B. Barla
    Karaca, Evren
    Kucukyilmaz, Tayfun
    Turk, Ata
    Aykanat, Cevdet
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (03) : 609 - 623