An interactive framework for spatial joins: a statistical approach to data analysis in GIS

被引:2
作者
Alkobaisi, Shayma [2 ]
Bae, Wan D. [1 ]
Vojtechovsky, Petr [3 ]
Narayanappa, Sada [4 ]
机构
[1] Univ Wisconsin Stout, Dept Math Stat & Comp Sci, Menomonie, WI USA
[2] United Arab Emirates Univ, Fac Informat Technol, Al Ain, U Arab Emirates
[3] Univ Denver, Dept Math, Denver, CO USA
[4] Jeppesen Inc, Adv Comp Technol, Englewood, CO USA
关键词
Interactive queries; Spatial join; Join probability; Probabilistic joins; Incremental sampling; Quad-tree; R-tree; GIS;
D O I
10.1007/s10707-011-0134-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many Geographic Information Systems (GIS) handle a large volume of geospatial data. Spatial joins over two or more geospatial datasets are very common operations in GIS for data analysis and decision support. However, evaluating spatial joins can be very time intensive due to the size of datasets. In this paper, we propose an interactive framework that provides faster approximate answers of spatial joins. The proposed framework utilizes two statistical methods: probabilistic join and sampling based join. The probabilistic join method provides speedup of two orders of magnitude with no correctness guarantee, while the sampling based method provides an order of magnitude improvement over the full indexing tree joins of datasets and also provides running confidence intervals. The framework allows users to trade-off speed versus bounded accuracy, hence it provides truly interactive data exploration. The two methods are evaluated empirically with real and synthetic datasets.
引用
收藏
页码:329 / 355
页数:27
相关论文
共 26 条
[1]  
An N, 2001, PROC INT CONF DATA, P368, DOI 10.1109/ICDE.2001.914849
[2]  
[Anonymous], P 1997 ACM SIGMOD IN
[3]  
[Anonymous], 2003, Proc. of the ACM SIGMOD International Conference on Management of Data, DOI DOI 10.1145/872757
[4]  
AZEVEDO LG, 2006, P ACM GIS, P187
[5]   IRSJ: incremental refining spatial joins for interactive queries in GIS [J].
Bae, Wan D. ;
Alkobaisi, Shayma ;
Leutenegger, Scott T. .
GEOINFORMATICA, 2010, 14 (04) :507-543
[6]  
BAE WD, 2010, P ACM INT S ADV GEOG, P19
[7]  
BAE WD, 2006, P INT C DAT EXP SYST, P935
[8]  
Brinkhoff T., 1993, Proceedings. Ninth International Conference on Data Engineering (Cat. No.92CH3258-1), P40, DOI 10.1109/ICDE.1993.344079
[9]  
BRINKHOFF T, 1993, P ACM SIGMOD, P127
[10]  
Cheng Reynold, 2006, Proceedings of the 15th ACM international conference on Information and knowledge management, CIKM '06, P738