Optimizing top-k selection queries over multimedia repositories

被引:36
|
作者
Chaudhuri, S [1 ]
Gravano, L
Marian, A
机构
[1] Microsoft Corp, Res, 1 Microsoft Way, Redmond, WA 98052 USA
[2] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
top-k query processing; multimedia databases; information search; information retrieval;
D O I
10.1109/TKDE.2004.30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Repositories of multimedia objects having multiple types of attributes ( e. g., image, text) are becoming increasingly common. A query on these attributes will typically request not just a set of objects, as in the traditional relational query model ( filtering), but also a grade of match associated with each object, which indicates how well the object matches the selection condition ( ranking). Furthermore, unlike in the relational model, users may just want the k top-ranked objects for their selection queries for a relatively small k. In addition to the differences in the query model, another peculiarity of multimedia repositories is that they may allow access to the attributes of each object only through indexes. In this paper, we investigate how to optimize the processing of top-k selection queries over multimedia repositories. The access characteristics of the repositories and the above query model lead to novel issues in query optimization. In particular, the choice of the indexes used to search the repository strongly influences the cost of processing the filtering condition. We define an execution space that is search-minimal, i.e., the set of indexes searched is minimal. Although the general problem of picking an optimal plan in the search-minimal execution space is NP-hard, we present an efficient algorithm that solves the problem optimally with respect to our cost model and execution space when the predicates in the query are independent. We also show that the problem of optimizing top-k selection queries can be viewed, in many cases, as that of evaluating more traditional selection conditions. Thus, both problems can be viewed together as an extended filtering problem to which techniques of query processing and optimization may be adapted.
引用
收藏
页码:992 / 1009
页数:18
相关论文
共 38 条
  • [1] Top-k queries over web applications
    Daniel Deutch
    Tova Milo
    Neoklis Polyzotis
    The VLDB Journal, 2013, 22 : 519 - 542
  • [2] Top-k queries over web applications
    Deutch, Daniel
    Milo, Tova
    Polyzotis, Neoklis
    VLDB JOURNAL, 2013, 22 (04): : 519 - 542
  • [3] Top-k selection queries over relational databases:: Mapping strategies and performance evaluation
    Bruno, N
    Chaudhuri, S
    Gravano, L
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2002, 27 (02): : 153 - 187
  • [4] Evaluating top-k queries over web-accessible databases
    Marian, A
    Bruno, N
    Gravano, L
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2004, 29 (02): : 319 - 362
  • [5] Optimizing top-k queries for middleware access: A unified cost-based approach
    Hwang, Seung-Won
    Chang, Kevin Chen-Chuan
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2007, 32 (01):
  • [6] Exact Top-K Queries in Wireless Sensor Networks
    Malhotra, Baljeet
    Nascimento, Mario A.
    Nikolaidis, Ioanis
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (10) : 1513 - 1525
  • [7] Efficient processing of exact top-k queries over disk-resident sorted lists
    Pang, HweeHwa
    Ding, Xuhua
    Zheng, Baihua
    VLDB JOURNAL, 2010, 19 (03): : 437 - 456
  • [8] Parallel Strategies for the Execution of Top-k Queries with MaxScore on GPUs
    Gaioso, Roussian
    Guardia, Helio
    Gil-Costa, Veronica
    Senger, Hermes
    2019 31ST INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2019), 2019, : 104 - 111
  • [9] A Scalable Algorithm for Answering Top-K Queries Using Cached Views
    Labbadi, Wissem
    Akaichi, Jalel
    FLEXIBLE QUERY ANSWERING SYSTEMS 2015, 2016, 400 : 257 - 270
  • [10] Top-K Entity Units Retrieval Over Big Data
    Zhang, Da
    Kabuka, Mansur R.
    WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 1269 - 1272