Spatial indexing of high-dimensional data based on relative approximation

被引:0
|
作者
Yasushi Sakurai
Masatoshi Yoshikawa
Shunsuke Uemura
Haruhiko Kojima
机构
[1] NTT Cyber Space Laboratories,
[2] 1-1 Hikari-no-oka,undefined
[3] Yokosuka,undefined
[4] Kanagawa 239-0847,undefined
[5] Japan; e-mail: sakurai.yasushi@lab.ntt.co.jp,undefined
[6] kojima@aether.hil.ntt.co.jp ,undefined
[7] Nara Institute of Science and Technology,undefined
[8] 8916-5 Takayama,undefined
[9] Ikoma,undefined
[10] Nara 630-0101,undefined
[11] Japan; e-mail: }yosikawa,undefined
[12] uemura}@is.aist-nara.ac.jp ,undefined
来源
The VLDB Journal | 2002年 / 11卷
关键词
Key words:High-dimensional data – Similarity search – Relative approximation;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a novel index structure, the A-tree (approximation tree), for similarity searches in high-dimensional data. The basic idea of the A-tree is the introduction of virtual bounding rectangles (VBRs) which contain and approximate MBRs or data objects. VBRs can be represented quite compactly and thus affect the tree configuration both quantitatively and qualitatively. First, since tree nodes can contain a large number of VBR entries, fanout becomes large, which increases search speed. More importantly, we have a free hand in arranging MBRs and VBRs in the tree nodes. Each A-tree node contains an MBR and its children VBRs. Therefore, by fetching an A-tree node, we can obtain information on the exact position of a parent MBR and the approximate position of its children. We have performed experiments using both synthetic and real data sets. For the real data sets, the A-tree outperforms the SR-tree and the VA-file in all dimensionalities up to 64 dimensions, which is the highest dimension in our experiments. Additionally, we propose a cost model for the A-tree. We verify the validity of the cost model for synthetic and real data sets.
引用
收藏
页码:93 / 108
页数:15
相关论文
共 50 条
  • [1] Spatial indexing of high-dimensional data based on relative approximation
    Sakurai, Y
    Yoshikawa, M
    Uemura, S
    Kojima, H
    VLDB JOURNAL, 2002, 11 (02): : 93 - 108
  • [2] An indexing technique using relative approximation for high-dimensional data
    Sakurai, Y., 1600, John Wiley and Sons Inc. (34):
  • [3] Approximate retrieval of high-dimensional data by spatial indexing
    Shinohara, T
    An, JY
    Ishizaka, H
    DISCOVERY SCIENCE, 1998, 1532 : 141 - 149
  • [4] High-Dimensional Indexing by Sparse Approximation
    Borges, Pedro
    Mourao, Andre
    Magalhaes, Joao
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 163 - 170
  • [5] Spatial indexing by virtual bounding rectangles for high-dimensional data
    Sakurai, Y
    Yoshikawa, M
    Uemura, S
    INFORMATION ORGANIZATION AND DATABASES: FOUNDATIONS OF DATA ORGANIZATION, 2000, 579 : 267 - 279
  • [6] Vector Approximation based Indexing for High-Dimensional Multimedia Databases
    Daoudi, I.
    Ouatik, S. E.
    El Kharraz, A.
    Idrissi, K.
    Aboutajdine, D.
    ENGINEERING LETTERS, 2008, 16 (02)
  • [7] A hyperplane based indexing technique for high-dimensional data
    Wang, Guoren
    Zhou, Xiangmin
    Wang, Bin
    Qiao, Baiyou
    Han, Donghong
    INFORMATION SCIENCES, 2007, 177 (11) : 2255 - 2268
  • [8] Interpretable Approximation of High-Dimensional Data
    Potts, Daniel
    Schmischke, Michael
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (04): : 1301 - 1323
  • [9] Approximate retrieval of high-dimensional data with L1 metric by spatial indexing
    Shinohara, T
    An, JY
    Ishizaka, H
    NEW GENERATION COMPUTING, 2000, 18 (01) : 39 - 47
  • [10] Approximate retrieval of high-dimensional data withL1 metric by spatial indexing
    Takeshi Shinohara
    Jiyuan An
    Hiroki Ishizaka
    New Generation Computing, 2000, 18 : 39 - 47