Spatial data extension for Cassandra NoSQL database

被引:26
作者
Ben Brahim M. [1 ]
Drira W. [1 ]
Filali F. [1 ]
Hamdi N. [2 ]
机构
[1] Qatar Mobility Innovations Center, Qatar Science and Technology Park, Doha
[2] University of Carthage, INSAT, Tunis
关键词
Big data; Cassandra DB; Geohash; NoSQL databases; Spatial query;
D O I
10.1186/s40537-016-0045-4
中图分类号
学科分类号
摘要
The big data phenomenon is becoming a fact. Continuous increase of digitization and connecting devices to Internet are making current solutions and services smarter, richer and more personalized. The emergence of the NoSQL databases, like Cassandra, with their massive scalability and high availability encourages us to investigate the management of the stored data within such storage system. In our present work, we harness the geohashing technique to enable spatial queries as extension to Cassandra query language capabilities while preserving the native syntax. The developed framework showed the feasibility of this approach where basic spatial queries are underpinned and the query response time is reduced by up to 70 times for a fairly large area. © 2016, The Author(s).
引用
收藏
相关论文
共 9 条
[1]  
Drira W., Filali F., Ndn-q: an ndn query mechanism for efficient v2x data collection. IEEE 11th annual international conference on sensing, communication, and networking workshops (SECON Workshops), (2014)
[2]  
Lakhshman A., Malik P., Cassandra: a decentralized structured storage system, ACM SIGOPS Operating Syst Rev, 44, 2, pp. 35-40, (2010)
[3]  
Robinson I., Webber J., Eifrem E., Graph databases: new opportunities for connected data, (2015)
[4]  
Moniruzzaman A.B., Hossain S.A., Nosql database: new era of databases for big data analytics—classification, characteristics and comparison, Int J Database Theor Appl, 6, 4, pp. 1-13, (2013)
[5]  
Cuzzocrea A., Song I.Y., Davis K.C., Analytics over large-scale multidimentional data: the big data reveolution, (2011)
[6]  
Samet H., Aref W.G., Spatial data models and query processing. Modern database systems: the object model, interoperability, and beyond, (1994)
[7]  
Malensek M., Pallickara S., Pallickara S. Polygon-based query evaluation over geospatial data using distributed hash tables. UCC ’13 Proceedings of the, IEEE/ACM 6th international conference on utility and cloud computing, 2013, pp. 219-226, (2013)
[8]  
Zhong Y., Han J., Zhang T., Li Z., Fang J., Chen G., Towards parallel spatial query processing for big spatial data, IEEE 26th International parallel and distributed processing symposium workshops and PhD forum, (2012)
[9]  
Arnold T., An entropy maximizing geohash for distributed spatiotemporal database indexing, arXiv:1506.05158v1 [cs.DB], (2015)