Efficient Top-k Document Retrieval Using a Term-Document Binary Matrix

被引:0
|
作者
Fujita, Etsuro [1 ]
Oyama, Keizo [1 ]
机构
[1] Grad Univ Adv Studies SOKENDAI, Tokyo, Japan
来源
INFORMATION RETRIEVAL TECHNOLOGY | 2011年 / 7097卷
关键词
web search engine; top-k query processing; early pruning; early termination; term-document binary matrix;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current web search engines perform well for "navigational queries." However, due to their use of simple conjunctive Boolean filters, such engines perform poorly for "informational queries." Informational queries would be better handled by a web search engine using an informational retrieval model along with a combination of enhancement techniques such as query expansion and relevance feedback, and the realization of such a engine requires a method to prosess the model efficiently. In this paper, we describe a novel extension of an existing top-k query processing technique. We add a simple data structure called a "term-document binary matrix," resulting in more efficient evaluation of top-k queries even when the queries have been expanded. We show on the basis of experimental evaluation using the TREC GOV2 data set and expanded versions of the evaluation queries attached to this data set that the expanded technique achieves significant performance gains over existing techniques.
引用
收藏
页码:293 / 302
页数:10
相关论文
共 13 条
  • [1] Efficient Top-k Document Retrieval for Long Queries Using Term-Document Binary Matrix - Pursuit of Enhanced Informational Search on the Web
    Fujita, Etsuro
    Oyama, Keizo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (05): : 1016 - 1028
  • [2] Faster Top-k Document Retrieval Using Block-Max Indexes
    Ding, Shuai
    Suel, Torsten
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 993 - 1002
  • [3] Efficient Top-k Retrieval on Massive Data
    Han, Xixian
    Li, Jianzhong
    Gao, Hong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (10) : 2687 - 2699
  • [4] Efficient Approximate Top-k Query Algorithm Using Cube Index
    Chen, Dongqu
    Sun, Guang-Zhong
    Gong, Neil Zhenqiang
    WEB TECHNOLOGIES AND APPLICATIONS, 2011, 6612 : 155 - 167
  • [5] Efficient Top-k Dominating Computation on Massive Data
    Han, Xixian
    Li, Jianzhong
    Gao, Hong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (06) : 1199 - 1211
  • [6] Efficient Top-k Query Answering through its Top-N Rewritings Using Views
    Labbadi, Wissem
    Akaichi, Jalel
    PIKM'15: PROCEEDINGS OF THE 8TH PH.D. WORKSHOP IN INFORMATION AND KNOWLEDGE MANAGEMENT, 2015, : 35 - 42
  • [7] An efficient top-k query processing framework in mobile sensor networks
    Yang, Heejung
    Chung, Chin-Wan
    Kim, Myoung Ho
    DATA & KNOWLEDGE ENGINEERING, 2016, 102 : 78 - 95
  • [8] TopX:: efficient and versatile top-k query processing for semistructured data
    Theobald, Martin
    Bast, Holger
    Majumdar, Debapriyo
    Schenkel, Ralf
    Weikum, Gerhard
    VLDB JOURNAL, 2008, 17 (01): : 81 - 115
  • [9] TKEP: An efficient top-k query processing algorithm on massive data
    Han X.-X.
    Yang D.-H.
    Li J.-Z.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (08): : 1405 - 1417
  • [10] Top-k Query Processing for Combinatorial Objects Using Euclidean Distance
    Suzuki, Takanobu
    Takasu, Atsuhiro
    Adachi, Jun
    PROCEEDINGS OF THE 15TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '11), 2011, : 209 - 213