Finding Related Search Engine Queries by Web Community Based Query Enrichment

被引:7
|
作者
Li, Lin [1 ,2 ]
Otsuka, Shingo [3 ]
Kitsuregawa, Masaru [4 ]
机构
[1] Univ Tokyo, Dept Informat & Commun Engn, Tokyo, Japan
[2] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 430070, Peoples R China
[3] Natl Inst Mat Sci, Tsukuba, Ibaraki, Japan
[4] Univ Tokyo, Inst Ind Sci, Tokyo, Japan
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2010年 / 13卷 / 1-2期
关键词
relatedness; query enrichment; web access logs; web page archive; web community; RETRIEVAL;
D O I
10.1007/s11280-009-0077-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The conventional approaches of finding related search engine queries rely on the common terms shared by two queries to measure their relatedness. However, search engine queries are usually short and the term overlap between two queries is very small. Using query terms as a feature space cannot accurately estimate relatedness. Alternative feature spaces are needed to enrich the term based search queries. In this paper, given a search query, first we extract the Web pages accessed by users from Japanese Web access logs which store the users individual and collective behavior. From these accessed Web pages we usually can get two kinds of feature spaces, i.e, content-sensitive (e.g., nouns) and content-ignorant (e.g., URLs), to enrich the expressions of search queries. Then, the relatedness between search queries can be estimated on their enriched expressions. Our experimental results show that the URL feature space produces much lower precision scores than the noun feature space which, however, is not applicable in non-text pages, dynamic pages and so on. It is crucial to improve the quality of the URL (content-ignorant) feature space since it is generally available in all types of Web pages. We propose a novel content-ignorant feature space, called Web community which is created from a Japanese Web page archive by exploiting link analysis. Experimental results show that the proposed Web community feature space generates much better results than the URL feature space.
引用
收藏
页码:121 / 142
页数:22
相关论文
共 50 条
  • [1] Finding Related Search Engine Queries by Web Community Based Query Enrichment
    Lin Li
    Shingo Otsuka
    Masaru Kitsuregawa
    World Wide Web, 2010, 13 : 121 - 142
  • [2] Deriving query intents from web search engine queries
    Lewandowski, Dirk
    Drechsler, Jessica
    von Mach, Sonja
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2012, 63 (09): : 1773 - 1788
  • [3] Exploiting query repetition and regularity in an adaptive community-based Web search engine
    Smyth, B
    Balfe, E
    Freyne, J
    Briggs, P
    Coyle, M
    Boydell, O
    USER MODELING AND USER-ADAPTED INTERACTION, 2004, 14 (05) : 383 - 423
  • [4] Exploiting Query Repetition and Regularity in an Adaptive Community-Based Web Search Engine
    Barry Smyth
    Evelyn Balfe
    Jill Freyne
    Peter Briggs
    Maurice Coyle
    Oisin Boydell
    User Modeling and User-Adapted Interaction, 2004, 14 : 383 - 423
  • [5] Query minig for community based Web search
    Balfe, E
    Smyth, B
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 594 - 598
  • [6] Mining related queries from web search engine query logs using an improved association rule mining model
    Shi, Xiaodong
    Yang, Christopher C.
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (12): : 1871 - 1883
  • [7] Towards Web Search by Sentence Queries: Asking the Web for Query Substitutions
    Yamamoto, Yusuke
    Tanaka, Katsumi
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, 2011, 6588 : 83 - 92
  • [8] Investigating query bursts in a web search engine
    Subašić, Ilija
    Castillo, Carlos
    Web Intelligence and Agent Systems, 2013, 11 (02): : 107 - 124
  • [9] Query Optimizing on a Decentralized Web Search Engine
    Wang, Daze
    Zhou, Ying
    Davis, Joseph
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 878 - 879
  • [10] Search Engine Pictures: Empirical Analysis of a Web Search Engine Query Log
    Shoeleh, Farzaneh
    Zahedi, Mohammad Sadegh
    Farhoodi, Mojgan
    2017 3RD INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2017, : 90 - 95