Dynamic optimization of queries in pivot-based indexing

被引:5
作者
Bratsberg, Svein Erik [1 ]
Hetland, Magnus Lie [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Comp & Informat Sci, N-7491 Trondheim, Norway
关键词
Similarity search; Pivot-based indexing; Database trees; Optimized query processing;
D O I
10.1007/s11042-010-0614-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper evaluates the use of standard database indexes and query processing as a way to do metric indexing in the LAESA approach. By utilizing B-trees and R-trees as pivot-based indexes, we may use well-known optimization techniques from the database field within metric indexing and search. The novelty of this paper is that we use a cost-based approach to dynamically evaluate which and how many pivots to use in the evaluation of each query. By a series of measurements using our database prototype we are able to evaluate the performance of this approach. Compared to using all available pivots for filtering, the optimized approach gives half the response times for main memory data, but much more varied results for disk resident data. However, by use of the cost model we are able to dynamically determine when to bypass the indexes and simply perform a sequential scan of the base data. The conclusion of this evaluation is that it is beneficial to create many pivots, but to use only the most selective ones during evaluation of each query. R-trees give better performance than B-trees when utilizing all pivots, but when being able to dynamically select the best pivots, B-trees often provide better performance.
引用
收藏
页码:261 / 275
页数:15
相关论文
共 24 条
  • [1] Baioco GB, 2007, APPLIED COMPUTING 2007, VOL 1 AND 2, P527, DOI 10.1145/1244002.1244123
  • [2] Quadratic form: A robust metric for quantitative comparison of flow cytometric histograms
    Bernas, Tytus
    Asem, Elikplimi K.
    Robinson, J. Paul
    Rajwa, Bartek
    [J]. CYTOMETRY PART A, 2008, 73A (08) : 715 - 726
  • [3] Beyer K, 1999, LECT NOTES COMPUT SC, V1540, P217
  • [4] Pivot selection techniques for proximity searching in metric spaces
    Bustos, B
    Navarro, G
    Chávez, E
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (14) : 2357 - 2366
  • [5] A dynamic pivot selection technique for similarity search
    Bustos, Benjamin
    Pedreira, Oscar
    Brisaboa, Nieves
    [J]. SISAP 2008: FIRST INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2008, : 105 - +
  • [6] Chavez E., 1999, 6th International Symposium on String Processing and Information Retrieval. 5th International Workshop on Groupware (Cat. No.PR00268), P38, DOI 10.1109/SPIRE.1999.796576
  • [7] Searching in metric spaces
    Chávez, E
    Navarro, G
    BaezaYates, R
    Marroquín, JL
    [J]. ACM COMPUTING SURVEYS, 2001, 33 (03) : 273 - 321
  • [8] Ciaccia P., 1998, Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. PODS 1998, P59, DOI 10.1145/275487.275495
  • [9] Figueroa K, 2006, LECT NOTES COMPUT SC, V4007, P279
  • [10] Figuerora K, 2010, SISAP METRIC SPACE L