Implementing KDB-trees to support high-dimensional data

被引:18
作者
Orlandic, R [1 ]
Yu, BG [1 ]
机构
[1] IIT, Dept Comp Sci, Chicago, IL 60616 USA
来源
2001 INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, PROCEEDINGS | 2001年
关键词
multi-dimensional databases; point access methods; data dimensionality; performance evaluation;
D O I
10.1109/IDEAS.2001.938071
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of retrieving large volumes of high dimensional data is an important and timely issue in the area of database management. The guiding idea of this paper is to develop a general-purpose point access method that attacks the limitations of KDB-trees in high-dimensional spaces, while preserving their relatively good performance in low-dimensional situations. The proposed structure, called high-dimensional KDB-tree, eliminates downward propagation of splits associated with the original KDB-tree structure, which results in low storage utilization and rapid deterioration of the retrieval performance. Additional improvements in the storage and retrieval performance are achieved by removing certain redundant information from the interior nodes. Experimental results show that, in high-dimensional spaces, the proposed structure outperforms the original KDB-trees by a significant margin, while incurring no loss of performance in low-dimensional spaces. The structure also outperforms two other variants of KDB-trees investigated in the paper.
引用
收藏
页码:58 / 67
页数:2
相关论文
共 21 条
[1]  
AAPL case, 1990, ICSID REV, P590
[2]  
[Anonymous], P ACM SIGMOD INT C M
[3]   EXPECTED BEHAVIOR OF B+-TREES UNDER RANDOM INSERTIONS [J].
BAEZAYATES, RA .
ACTA INFORMATICA, 1989, 26 (05) :439-471
[4]  
Berchtold S, 1996, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P28
[5]  
Berchtold S., 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073), P577, DOI 10.1109/ICDE.2000.839456
[6]  
Beyer Kevin., 1999, INT C DATABASE THEOR, P217, DOI DOI 10.1007/3-540-49257-7_15
[7]   UBIQUITOUS B-TREE [J].
COMER, D .
COMPUTING SURVEYS, 1979, 11 (02) :121-137
[8]  
FREESTON M, 1995, P ACM SIGMOD INT C M, P80
[9]   Multidimensional access methods [J].
Gaede, V ;
Gunther, O .
ACM COMPUTING SURVEYS, 1998, 30 (02) :170-231
[10]   Efficient indexing of high-dimensional data through dimensionality reduction [J].
Goh, CH ;
Lim, A ;
Ooi, BC ;
Tan, KL .
DATA & KNOWLEDGE ENGINEERING, 2000, 32 (02) :115-130