Optimizing top-k selection queries over multimedia repositories

被引:36
|
作者
Chaudhuri, S [1 ]
Gravano, L
Marian, A
机构
[1] Microsoft Corp, Res, 1 Microsoft Way, Redmond, WA 98052 USA
[2] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
top-k query processing; multimedia databases; information search; information retrieval;
D O I
10.1109/TKDE.2004.30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Repositories of multimedia objects having multiple types of attributes ( e. g., image, text) are becoming increasingly common. A query on these attributes will typically request not just a set of objects, as in the traditional relational query model ( filtering), but also a grade of match associated with each object, which indicates how well the object matches the selection condition ( ranking). Furthermore, unlike in the relational model, users may just want the k top-ranked objects for their selection queries for a relatively small k. In addition to the differences in the query model, another peculiarity of multimedia repositories is that they may allow access to the attributes of each object only through indexes. In this paper, we investigate how to optimize the processing of top-k selection queries over multimedia repositories. The access characteristics of the repositories and the above query model lead to novel issues in query optimization. In particular, the choice of the indexes used to search the repository strongly influences the cost of processing the filtering condition. We define an execution space that is search-minimal, i.e., the set of indexes searched is minimal. Although the general problem of picking an optimal plan in the search-minimal execution space is NP-hard, we present an efficient algorithm that solves the problem optimally with respect to our cost model and execution space when the predicates in the query are independent. We also show that the problem of optimizing top-k selection queries can be viewed, in many cases, as that of evaluating more traditional selection conditions. Thus, both problems can be viewed together as an extended filtering problem to which techniques of query processing and optimization may be adapted.
引用
收藏
页码:992 / 1009
页数:18
相关论文
共 38 条
  • [21] Efficient Approximate Top-k Query Algorithm Using Cube Index
    Chen, Dongqu
    Sun, Guang-Zhong
    Gong, Neil Zhenqiang
    WEB TECHNOLOGIES AND APPLICATIONS, 2011, 6612 : 155 - 167
  • [22] TopX:: efficient and versatile top-k query processing for semistructured data
    Theobald, Martin
    Bast, Holger
    Majumdar, Debapriyo
    Schenkel, Ralf
    Weikum, Gerhard
    VLDB JOURNAL, 2008, 17 (01): : 81 - 115
  • [23] SPARK2: Top-k Keyword Query in Relational Databases
    Luo, Yi
    Wang, Wei
    Lin, Xuemin
    Zhou, Xiaofang
    Wang, Jianmin
    Li, Keqiu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (12) : 1763 - 1780
  • [24] TEII: Topic enhanced inverted index for top-k document retrieval
    Jiang, Di
    Leung, Kenneth Wai-Ting
    Yang, Lingxiao
    Ng, Wilfred
    KNOWLEDGE-BASED SYSTEMS, 2015, 89 : 346 - 358
  • [25] Efficient Top-k Query Answering through its Top-N Rewritings Using Views
    Labbadi, Wissem
    Akaichi, Jalel
    PIKM'15: PROCEEDINGS OF THE 8TH PH.D. WORKSHOP IN INFORMATION AND KNOWLEDGE MANAGEMENT, 2015, : 35 - 42
  • [26] Fast Top-K Path-Based Relevance Query on Massive Graphs
    Khemmarat, Samamon
    Gao, Lixin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (05) : 1189 - 1202
  • [27] A Candidate Filtering Mechanism for Fast Top-K Query Processing on Modern CPUs
    Dimopoulos, Constantinos
    Nepomnyachiy, Sergey
    Suel, Torsten
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 723 - 732
  • [28] Waves: a fast multi-tier top-k query processing algorithm
    Daoud, Caio Moura
    de Moura, Edleno Silva
    Fernandes, David
    da Silva, Altigran Soares
    Rossi, Cristian
    Carvalho, Andre
    INFORMATION RETRIEVAL JOURNAL, 2017, 20 (03): : 292 - 316
  • [29] Waves: a fast multi-tier top-k query processing algorithm
    Caio Moura Daoud
    Edleno Silva de Moura
    David Fernandes
    Altigran Soares da Silva
    Cristian Rossi
    Andre Carvalho
    Information Retrieval Journal, 2017, 20 : 292 - 316
  • [30] Faster Top-k Document Retrieval Using Block-Max Indexes
    Ding, Shuai
    Suel, Torsten
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 993 - 1002