Efficient Metric All-k-Nearest-Neighbor Search on Datasets Without Any Index

被引:0
作者
Hai-Da Zhang
Zhi-Hao Xing
Lu Chen
Yun-Jun Gao
机构
[1] Zhejiang University,College of Computer Science
[2] The University of New South Wales,School of Computer Science and Engineering
来源
Journal of Computer Science and Technology | 2016年 / 31卷
关键词
all-k-nearest-neighbor search; query processing; metric space;
D O I
暂无
中图分类号
学科分类号
摘要
An all-k-nearest-neighbor (AkNN) query finds k nearest neighbors for each query object. This problem arises naturally in many areas, such as GIS (geographic information system), multimedia retrieval, and recommender systems. To support various data types and flexible distance metrics involved in real applications, we study AkNN retrieval in metric spaces, namely, metric AkNN (MAkNN) search. Consider that the underlying indexes on the query set and the object set may not exist, which is natural in many scenarios. For example, the query set and the object set could be the results of other queries, and thus, the underlying indexes cannot be built in advance. To support MAkNN search on datasets without any underlying index, we propose an efficient disk-based algorithm, termed as Partition-Based MAkNN Algorithm (PMA), which follows a partition-search framework and employs a series of pruning rules for accelerating the search. In addition, we extend our techniques to tackle an interesting variant of MAkNN queries, i.e., metric self-AkNN (MSAkNN) search, where the query set is identical to the object set. Extensive experiments using both real and synthetic datasets demonstrate the effectiveness of our pruning rules and the efficiency of the proposed algorithms, compared with state-of-the-art MAkNN and MSAkNN algorithms.
引用
收藏
页码:1194 / 1211
页数:17
相关论文
共 51 条
[11]  
Sankaranarayanan J(2016)Metric all-knearest-neighbor search IEEE Transactions on Knowledge and Data Engineering 28 98-112
[12]  
Samet H(2001)Searching in metric spaces ACM Computing Surveys 33 273-321
[13]  
Varshney A(2006)Reverse nearest neighbor search in metric spaces IEEE Transactions on Knowledge and Data Engineering 18 1239-1252
[14]  
Nakano K(2012)Exploiting database similarity joins for metric spaces Proceedings of the VLDB Endowment 5 1922-1925
[15]  
Olariu S(2009)Efficient processing of metric skyline queries IEEE Transactions on Knowledge and Data Engineering 21 351-365
[16]  
Wang YR(2003)Pivot selection techniques for proximity searching in metric spaces Pattern Recognition Letters 24 2357-2366
[17]  
Horng SJ(2015)Overcoming the challenge of variety: Big data abstraction, the next evolution of data management for AAL communication systems IEEE Communications Magazine 53 42-47
[18]  
Wu CH(undefined)undefined undefined undefined undefined-undefined
[19]  
Yu C(undefined)undefined undefined undefined undefined-undefined
[20]  
Cui B(undefined)undefined undefined undefined undefined-undefined