Global multi-scale grid integer coding and spatial indexing: A novel approach for big earth observation data

被引:18
作者
Lei, Yi [1 ]
Tong, Xiaochong [1 ]
Zhang, Yongsheng [1 ]
Qiu, Chunping [1 ]
Wu, Xiangyu [1 ]
Lai, Guangling [1 ]
Li, He [1 ]
Guo, Congzhou [1 ]
Zhang, Yong [2 ]
机构
[1] Informat Engn Univ, Zhengzhou 450001, Peoples R China
[2] Zhengzhou Zhonghe Jingxuan Informat Technol Co Lt, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Multi-scale grid; Integer Coding; Clustering property; Data management; Big earth observation data; Spatial indexing and querying; ALGORITHM;
D O I
10.1016/j.isprsjprs.2020.03.010
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
With the exponentially growing earth observation data of specific sensor-determined resolutions and update frequencies, earth observation has irreversibly arrived in the Big Data era, enabling new insights in science and engineering. With great opportunity comes great challenges regarding efficient and effective data management because earth observation data is of different scales and characterized by complexity in spatial relationships related to the real world. To overcome the challenges is crucial for, for instance, data mining, land surveying, and especially emergency mapping for disaster response. To improve the querying efficiency of big earth observation data, we proposed a novel data management approach: Global Multi-scale Grid Integer Coding and Spatial Indexing. Among our contributions are: (1) proposing Global Multi-scale Grid Integer Coding Model (GMGICM), which presents clustering property in both the scale dimension and spatial dimension, and theoretically facilitates an efficient querying; (2) deliberately applying GMGICM on multi-scale earth observation data spatial indexing, which results in one-dimensional data index, which can be queried using simple B-tree, inversion, and other one-dimensional indexes; (3) designing a strategy to assure the completeness of spatial querying, which is not well solved by existing grid-based coding models. The advantages of our proposed approach have been demonstrated with both simulated and real remote sensing data, with spatial operation 20 times as fast as Geohash and spatial querying 10 times as fast as Oracle Spatial on average. The proposed approach can be easily adapted for three or higher-dimensional earth observation data and bring potential benefit to all big earth observation data analytic projects.
引用
收藏
页码:202 / 213
页数:12
相关论文
共 45 条
[1]  
[Anonymous], 2002, P 2002 ACM SIGMOD IN
[2]  
[Anonymous], 2009, ENCY DATABASE SYSTEM
[3]  
BAYER R, 1997, INT C WORLDW COMP IT
[4]  
Bylinski C., 1989, J FORMALIZED MATH, V1, P9
[5]  
Camara Gilberto, 2016, 5 ACM SIGSPATIAL INT
[6]   Bigtable: A distributed storage system for structured data [J].
Chang, Fay ;
Dean, Jeffrey ;
Ghemawat, Sanjay ;
Hsieh, Wilson C. ;
Wallach, Deborah A. ;
Burrows, Mike ;
Chandra, Tushar ;
Fikes, Andrew ;
Gruber, Robert E. .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (02)
[7]   Normalized Difference Flood Index for rapid flood mapping: Taking advantage of EO big data [J].
Cian, Fabio ;
Marconcini, Mattia ;
Ceccato, Pietro .
REMOTE SENSING OF ENVIRONMENT, 2018, 209 :712-730
[8]  
Davis N., 2018, IEEE T INTELL TRANSP, P1
[9]  
Faloutsos C., 1989, Proceedings of the Eighth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, P247, DOI 10.1145/73721.73746
[10]  
Gibin M., 2008, Applied Spatial Analysis and Policy, V1, P85, DOI DOI 10.1007/S12061-008-9005-5