Optimizing top-k selection queries over multimedia repositories

被引:36
|
作者
Chaudhuri, S [1 ]
Gravano, L
Marian, A
机构
[1] Microsoft Corp, Res, 1 Microsoft Way, Redmond, WA 98052 USA
[2] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
top-k query processing; multimedia databases; information search; information retrieval;
D O I
10.1109/TKDE.2004.30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Repositories of multimedia objects having multiple types of attributes ( e. g., image, text) are becoming increasingly common. A query on these attributes will typically request not just a set of objects, as in the traditional relational query model ( filtering), but also a grade of match associated with each object, which indicates how well the object matches the selection condition ( ranking). Furthermore, unlike in the relational model, users may just want the k top-ranked objects for their selection queries for a relatively small k. In addition to the differences in the query model, another peculiarity of multimedia repositories is that they may allow access to the attributes of each object only through indexes. In this paper, we investigate how to optimize the processing of top-k selection queries over multimedia repositories. The access characteristics of the repositories and the above query model lead to novel issues in query optimization. In particular, the choice of the indexes used to search the repository strongly influences the cost of processing the filtering condition. We define an execution space that is search-minimal, i.e., the set of indexes searched is minimal. Although the general problem of picking an optimal plan in the search-minimal execution space is NP-hard, we present an efficient algorithm that solves the problem optimally with respect to our cost model and execution space when the predicates in the query are independent. We also show that the problem of optimizing top-k selection queries can be viewed, in many cases, as that of evaluating more traditional selection conditions. Thus, both problems can be viewed together as an extended filtering problem to which techniques of query processing and optimization may be adapted.
引用
收藏
页码:992 / 1009
页数:18
相关论文
共 38 条
  • [31] Exact and Approximate Generic Multi-criteria Top-k Query Processing
    Badr, Mehdi
    Vodislav, Dan
    TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE-CENTERED SYSTEMS XVIII: SPECIAL ISSUE ON DATABASE- AND EXPERT-SYSTEMS APPLICATIONS, 2015, 8980 : 53 - 79
  • [32] Fast top-k preserving query processing using two-tier indexes
    Daoud, Caio Moura
    de Moura, Edleno Silva
    Carvalho, Andre
    da Silva, Altigran Soares
    Fernandes, David
    Rossi, Cristian
    INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (05) : 855 - 872
  • [33] A Model-based Keyword Search Approach for Detecting Top-k Effective Answers
    Ghanbarpour, Asieh
    Naderi, Hassan
    COMPUTER JOURNAL, 2019, 62 (03): : 377 - 393
  • [34] Top-k Query Processing and Malicious Node Identification Based on Node Grouping in MANETs
    Tsuda, Takuji
    Komai, Yuka
    Hara, Takahiro
    Nishio, Shojiro
    IEEE ACCESS, 2016, 4 : 993 - 1007
  • [35] Efficient Top-k Document Retrieval Using a Term-Document Binary Matrix
    Fujita, Etsuro
    Oyama, Keizo
    INFORMATION RETRIEVAL TECHNOLOGY, 2011, 7097 : 293 - 302
  • [36] A Review on Top-k Query Processing and Prevention of Malicious Nodes Performing DRA and FNA In MANETs
    Mathew, Melseena
    Yomas, Jerrin
    2018 INTERNATIONAL CONFERENCE ON CONTROL, POWER, COMMUNICATION AND COMPUTING TECHNOLOGIES (ICCPCCT), 2018, : 288 - 291
  • [37] Top-k query processing in mobile-P2P networks using economic incentive schemes
    Padhariya, Nilesh
    Mondal, Anirban
    Madria, Sanjay Kumar
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2016, 9 (04) : 731 - 751
  • [38] Dissimilarity-constrained node attribute coverage diversification for novelty-enhanced top-k search in large attributed networks
    Meng, Zaiqiao
    Shen, Hong
    KNOWLEDGE-BASED SYSTEMS, 2018, 150 : 85 - 94