Parallelizing uncertain skyline computation against n-of-N data streaming model

被引:6
|
作者
Liu, Jun [1 ,2 ]
Li, Xiaoyong [1 ,2 ]
Ren, Kaijun [1 ,2 ]
Song, Junqiang [1 ,2 ]
机构
[1] Natl Univ Def Technol, Coll Meteorol & Oceanol, Changsha, Hunan, Peoples R China
[2] Natl Univ Def Technol, Coll Comp, Changsha, Hunan, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
data streams; n-of-N model; parallel queries; skyline queries; uncertain data; QUERIES; FRAMEWORK; MULTICORE;
D O I
10.1002/cpe.4848
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The skyline query over uncertain data streams, as an important aspect of big data analysis, plays a significant role in domains such as environment monitoring, decision-making, and data mining. The skyline query over uncertain data streams with sliding window model always focuses on the most recent N streaming items, which cannot meet the query requirements of different window scales at the same time. To improve the query flexibility and efficiency, we propose an efficient parallel method for processing uncertain n-of-N skyline queries; that is, computing the skyline for the most recent n (for all n <= N) items in parallel. Specifically, we first propose a framework for parallelizing the query computation for uncertain n-of-N skylines. Furthermore, we put forward a sliding window partitioning strategy as well as a streaming items mapping strategy to realize the load balance for each node. In addition, we define a spatial index structure RST based on R-tree to organize the elements within each individual sliding window and candidate set in each which can significantly improve the dominance tests. Most importantly, we provide an encoding interval scheme to transform the n-of-N query into stabbing query in each compute node, which can greatly minimize the query scope and improve the query efficiency. In addition, we use a red-black tree named RBI to store all stabbing intervals. Extensive experimental results demonstrate that the proposals are efficient and can greatly meet the query requirement of users in real applications.
引用
收藏
页数:20
相关论文
共 5 条
  • [1] Efficient probabilistic skyline computation against n-of-N data stream model
    Yang, Y.-T. (ytyang@nudt.edu.cn), 1600, Chinese Academy of Sciences (23): : 550 - 564
  • [2] Parallel n-of-N Skyline Queries over Uncertain Data Streams
    Liu, Jun
    Li, Xiaoyong
    Ren, Kaijun
    Song, Junqiang
    Zhang, Zongshuo
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 176 - 184
  • [3] Parallelizing skyline queries over uncertain data streams with sliding window partitioning and grid index
    Li, Xiaoyong
    Wang, Yijie
    Li, Xiaoling
    Wang, Yuan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 277 - 309
  • [4] Parallelizing skyline queries over uncertain data streams with sliding window partitioning and grid index
    Xiaoyong Li
    Yijie Wang
    Xiaoling Li
    Yuan Wang
    Knowledge and Information Systems, 2014, 41 : 277 - 309
  • [5] Analyzing Uncertain Time Series Temperature Data for Forecasting and Streaming Using EDA-LSTM Model
    Raju, Gara Jaya
    Raju, G. Samuel Vara Prasada
    INTERNET TECHNOLOGY LETTERS, 2024,