BBoxDB streams: scalable processing of multi-dimensional data streams

被引:0
作者
Jan Kristof Nidzwetzki
Ralf Hartmut Güting
机构
[1] FernUniversität Hagen,Faculty of Mathematics and Computer Science
来源
Distributed and Parallel Databases | 2022年 / 40卷
关键词
Distributed stream processing; Spatial stream processing; Continuous queries; Distributed datastore; Multi-dimensional data;
D O I
暂无
中图分类号
学科分类号
摘要
BBoxDB Streams is a distributed stream processing system, which allows the handling of multi-dimensional data. Multi-dimensional streams consist of n-dimensional elements, such as position data (e.g., two-dimensional positions of cars or three-dimensional positions of aircraft). The software is an enhancement of BBoxDB, a distributed key-bounding-box-value store that allows the handling of n-dimensional big data. BBoxDB Streams supports continuous range queries and continuous spatial joins; n-dimensional point and non-point data are supported. Operations in BBoxDB Streams are performed primarily on the bounding boxes of the data. With user-defined filters (UDFs), custom data formats can be decoded, and the bounding box-based operations are refined (e.g., a UDF decodes and performs intersection tests on the real geometries of WKT encoded stream elements). A unique feature of BBoxDB Streams is the ability to perform continuous spatial joins between stream elements and previously stored multi-dimensional big data. For example, the dynamic position of a car can be efficiently joined with the static spatial data of a street network.
引用
收藏
页码:559 / 625
页数:66
相关论文
共 76 条
  • [1] Terry D(1992)Continuous queries over append-only databases SIGMOD Rec. 21 321-330
  • [2] Goldberg D(2020)BBoxDB: a distributed and highly available key-bounding-box-value store Distrib. Parallel Databases 38 439-493
  • [3] Nichols D(2012)CAP twelve years later: how the “rules” have changed IEEE Comput. 45 23-29
  • [4] Oki B(2010)Secondo: a platform for moving objects database research and for publishing and integrating research implementations IEEE Data Eng. Bull. 33 56-63
  • [5] Nidzwetzki JK(2010)Cassandra: a decentralized structured storage system SIGOPS Oper. Syst. Rev. 44 35-40
  • [6] Güting RH(2017)M-grid: a distributed framework for multidimensional indexing and querying of location based data Distrib. Parallel Databases 35 55-81
  • [7] Brewer E(2013)Scidb: a database management system for applications with complex analytics Comput. Sci. Eng. 15 54-62
  • [8] Güting RH(2015)Parallel SECONDO: scalable query processing in the cloud for non-standard applications SIGSPATIAL Spec. 6 3-10
  • [9] Behr T(2017)Distributed Secondo: an extensible and scalable database management system Distrib. Parallel Databases 35 197-248
  • [10] Düntgen C(2008)MapReduce: simplified data processing on large clusters Commun. ACM 51 107-113