Towards Parallel Spatial Query Processing for Big Spatial Data

被引:38
作者
Zhong, Yunqin [1 ,4 ]
Han, Jizhong [1 ]
Zhang, Tieying [1 ,4 ]
Li, Zhenhua [2 ]
Fang, Jinyun [1 ]
Chen, Guihai [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] Grad Univ Chinese Acad Sci, Beijing, Peoples R China
来源
2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW) | 2012年
关键词
spatial data management; distributed storage; spatial index; spatial query; spatial applications;
D O I
10.1109/IPDPSW.2012.245
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In recent years, spatial applications have become more and more important in both scientific research and industry. Spatial query processing is the fundamental functioning component to support spatial applications. However, the state-of-the-art techniques of spatial query processing are facing significant challenges as the data expand and user accesses increase. In this paper we propose and implement a novel scheme (named VegaGiStore) to provide efficient spatial query processing over big spatial data and numerous concurrent user queries. Firstly, a geography-aware approach is proposed to organize spatial data in terms of geographic proximity, and this approach can achieve high aggregate I/O throughput. Secondly, in order to improve data retrieval efficiency, we design a two-tier distributed spatial index for efficient pruning of the search space. Thirdly, we propose an "indexing + MapReduce" data processing architecture to improve the computation capability of spatial query. Performance evaluations of the real-deployed VegaGiStore system confirm its effectiveness.
引用
收藏
页码:2085 / 2094
页数:10
相关论文
共 19 条
  • [1] [Anonymous], 1994, VLDB J, DOI [10.1007/BF01231602, DOI 10.1007/BF01231602]
  • [2] Brakatsoulas S., 2002, LECT NOTES COMPUTER, P17
  • [3] Bigtable: A distributed storage system for structured data
    Chang, Fay
    Dean, Jeffrey
    Ghemawat, Sanjay
    Hsieh, Wilson C.
    Wallach, Deborah A.
    Burrows, Mike
    Chandra, Tushar
    Fikes, Andrew
    Gruber, Robert E.
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (02):
  • [4] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [5] PARALLEL DATABASE-SYSTEMS - THE FUTURE OF HIGH-PERFORMANCE DATABASE-SYSTEMS
    DEWITT, D
    GRAY, J
    [J]. COMMUNICATIONS OF THE ACM, 1992, 35 (06) : 85 - 98
  • [6] Data management in location-dependent information services
    Lee, Dik Lun
    Lee, Wang-Chien
    Xu, Jianliang
    Zheng, Baihua
    [J]. IEEE Pervasive Computing, 2002, 1 (03) : 65 - 72
  • [7] SPATIAL SQL - A QUERY AND PRESENTATION LANGUAGE
    EGENHOFER, MJ
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1994, 6 (01) : 86 - 95
  • [8] Multidimensional access methods
    Gaede, V
    Gunther, O
    [J]. ACM COMPUTING SURVEYS, 1998, 30 (02) : 170 - 231
  • [9] Kamel I., 1992, P 1992 ACM SIGMOD IN, P195, DOI [10.1145/141484.130315, DOI 10.1145/141484.130315]
  • [10] Kanth Kothuri Venkata Ravi, 2002, P 2002 ACM SIGMOD IN, P546, DOI [DOI 10.1145/564691.564755, 10.1145/564691.564755]