Efficient Probabilistic Skyline Query Processing in MapReduce

被引:6
作者
Ding, Linlin [1 ]
Wang, Guoren [1 ]
Xin, Junchang [1 ]
Yuan, Ye [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
来源
2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA | 2013年
关键词
probabilistic skyline; MapReduce; uncertain data; RANKING;
D O I
10.1109/BigData.Congress.2013.35
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As a popular parallel programming model, how to process probabilistic skyline query over uncertain data in MapReduce framework is becoming an urgent problem to be resolved. In MapReduce framework, implementing probabilistic skyline query is nontrivial since the probabilistic skyline query is not decomposable. Therefore, in this paper, we propose a filter-refine two phases approach in MapReduce that translates the probabilistic skyline query into two decomposable computations for obtaining the final results. Firstly, we describe the whole processing procedure of filter-refine, and then propose an efficient probabilistic skyline query processing algorithm in MapReduce. Furthermore, to reduce the computation and communication cost, we develop the optimized probabilistic skyline query processing algorithm to prune the unpromising data both in filter and refine phases. Finally, we conduct extensive experiments on synthetic data to verify the effectiveness and efficiency of the proposed
引用
收藏
页码:203 / 210
页数:8
相关论文
共 16 条
[1]  
[Anonymous], 2006, IEEE Date Eng. Bull.
[2]  
[Anonymous], 2009, Proceedings of the VLDB Endowment
[3]   Computing All Skyline Probabilities for Uncertain Data [J].
Atallah, Mikhail J. ;
Qi, Yinian .
PODS'09: PROCEEDINGS OF THE TWENTY-EIGHTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2009, :279-287
[4]  
Bu YY, 2010, PROC VLDB ENDOW, V3, P285
[5]   Approximate aggregation techniques for sensor databases [J].
Considine, J ;
Li, FF ;
Kollios, G ;
Byers, J .
20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, :449-460
[6]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[7]   Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data [J].
Ding, Xiaofeng ;
Jin, Hai .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) :1448-1462
[8]  
Junchang Xin, 2011, 2011 Seventh International Conference on Natural Computation (ICNC 2011), P311, DOI 10.1109/ICNC.2011.6021918
[9]  
Li FF, 2009, ACM SIGMOD/PODS 2009 CONFERENCE, P361
[10]  
Lian Xiang., 2008, P ACM SIGMOD INT C M, P213