Parallel Algorithms for Constructing Range and Nearest-Neighbor Searching Data Structures

被引:14
|
作者
Agarwal, Pankaj K. [1 ]
Fox, Kyle [1 ]
Munagala, Kamesh [1 ]
Nath, Abhinandan [1 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
基金
美国国家科学基金会;
关键词
MODEL; COMPLEXITY; MAPREDUCE;
D O I
10.1145/2902251.2902303
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the massive amounts of data available today, it is common to store and process data using multiple machines. Parallel programming platforms such as MapReduce and its variants are popular frameworks for handling such large data. We present the first provably efficient algorithms to compute, store, and query data structures for range queries and approximate nearest neighbor queries in a popular parallel computing abstraction that captures the salient features of MapReduce and other massively parallel communication (MPC) models. In particular, we describe algorithms for kd-trees, range trees, and BBD-trees that only require O(1) rounds of communication for both preprocessing and querying while staying competitive in terms of running time and workload to their classical counterparts. Our algorithms are randomized, but they can be made deterministic at some increase in their running time and workload while keeping the number of rounds of communication to be constant.
引用
收藏
页码:429 / 440
页数:12
相关论文
共 50 条
  • [11] Scalable Algorithms for Nearest-Neighbor Joins on Big Trajectory Data
    Fang, Yixiang
    Cheng, Reynold
    Tang, Wenbin
    Maniu, Silviu
    Yang, Xuan
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1528 - 1529
  • [12] A HASHING-ORIENTED NEAREST-NEIGHBOR SEARCHING SCHEME
    CHANG, CC
    WU, TC
    PATTERN RECOGNITION LETTERS, 1993, 14 (08) : 625 - 630
  • [13] Fast nearest-neighbor searching for nonlinear signal processing
    Merkwirth, Christian
    Parlitz, Ulrich
    Lauterborn, Werner
    Physical Review E - Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics, 2000, 62 (2 A): : 2089 - 2097
  • [14] Fast nearest-neighbor searching for nonlinear signal processing
    Merkwirth, C
    Parlitz, U
    Lauterborn, W
    PHYSICAL REVIEW E, 2000, 62 (02): : 2089 - 2097
  • [15] REFINEMENTS TO NEAREST-NEIGHBOR SEARCHING IN K-DIMENSIONAL TREES
    SPROULL, RF
    ALGORITHMICA, 1991, 6 (04) : 579 - 589
  • [16] EFFECTIVE ALGORITHMS FOR THE NEAREST-NEIGHBOR METHOD IN THE CLUSTERING PROBLEM
    HATTORI, K
    TORII, Y
    PATTERN RECOGNITION, 1993, 26 (05) : 741 - 746
  • [17] Refinements to nearest-neighbor searching in k-dimensional trees
    Sproull, Robert F.
    Algorithmica (New York), 1991, 6 (04): : 579 - 589
  • [18] NEAREST-NEIGHBOR HEURISTICS IN ACCELERATED ALGORITHMS OF OPTIMIZATION PROBLEMS
    LIN, SC
    HSUEH, HC
    PHYSICA A, 1994, 203 (3-4): : 369 - 380
  • [19] MapReduce Algorithms for the K Group Nearest-Neighbor Query
    Moutafis, Panagiotis
    Garcia-Garcia, Francisco
    Mavrommatis, George
    Vassilakopoulos, Michael
    Corral, Antonio
    Iribarne, Luis
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 448 - 455
  • [20] Disk allocation for fast range and nearest-neighbor queries
    Prabhakar, S
    Agrawal, D
    El Abbadi, A
    DISTRIBUTED AND PARALLEL DATABASES, 2003, 14 (02) : 107 - 135