Efficient subspace skyline query based on user preference using MapReduce

被引:17
作者
Li, Yuanyuan [1 ,2 ]
Li, Zhiyang [1 ]
Dong, Mianxiong [3 ]
Qu, Wenyu [1 ]
Ji, Changqing [4 ]
Wu, Junfeng [1 ,5 ]
机构
[1] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian, Peoples R China
[2] Dalian Jiaotong Univ, Sch Software, Dalian, Peoples R China
[3] Muroran Inst Technol, Dept Informat & Elect Engn, Muroran, Hokkaido, Japan
[4] Dalian Univ, Coll Phys Sci & Technol, Dalian 116012, Peoples R China
[5] Dalian Ocean Univ, Sch Informat Engn, Dalian, Peoples R China
关键词
Subspace skyline query; MapReduce; Pruning strategy; Grid; User preference;
D O I
10.1016/j.adhoc.2015.07.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Subspace skyline, as an important variant of skyline, has been widely applied for multiple-criteria decisions, business planning. With the development of mobile internet, subspace skyline query in mobile distributed environments has recently attracted considerable attention. However, efficiently obtaining the meaningful subset of skyline points in any subspace remains a challenging task in the current mobile interne. For more and more mobile applications, subspace skyline query on mobile units is usually limited by big data and wireless bandwidth. To address this issue, in this paper, we propose a system model that can support subspace skyline query in mobile distributed environment. An efficient algorithm for processing the Subspace Skyline Query using MapReduce (SSQ) is also presented which can obtain the meaningful subset of points from the full set of skyline points in any subspace. The SSQ algorithm divides a subspace skyline query into two processing phases: the preprocess phase and the query phase. The preprocess phase includes the pruning process and constructing index process which is designed to reduce network delay and response time. Additionally, the query phase provides two filtering methods, SQM-filtering and epsilon-filtering, to filter the skyline points according to user preference and reduce network cost. Extensive experiments on real and synthetic data are conducted and the experimental results indicate that our algorithm is much efficient, meanwhile, the pruning strategy can further improve the efficiency of the algorithm. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:105 / 115
页数:11
相关论文
共 27 条
  • [1] Balke W.-T., 2004, P INT C EXT DAT TECH, P256
  • [2] The Skyline operator
    Börzsönyi, S
    Kossmann, D
    Stocker, K
    [J]. 17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, : 421 - 430
  • [3] Chan Chee-Yong., 2006, PROC ACM SPECIAL INT, P503
  • [4] MapReduce Skyline Query Processing with A New Angular Partitioning Approach
    Chen, Liang
    Hwang, Kai
    Wu, Jian
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2262 - 2270
  • [5] Chomicki J., 2002, P IEEE ICDE, P717
  • [6] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [7] ComMapReduce: An improvement of MapReduce with lightweight communication mechanisms
    Ding, Linlin
    Wang, Guoren
    Xin, Junchang
    Wang, Xiaoyang
    Huang, Shan
    Zhang, Rui
    [J]. DATA & KNOWLEDGE ENGINEERING, 2013, 88 : 224 - 247
  • [8] Efficient Skyline Computation on Big Data
    Han, Xixian
    Li, Jianzhong
    Yang, Donghua
    Wang, Jinbao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (11) : 2521 - 2535
  • [9] A survey of skyline processing in highly distributed environments
    Hose, Katja
    Vlachou, Akrivi
    [J]. VLDB JOURNAL, 2012, 21 (03) : 359 - 384
  • [10] Scalable nearest neighbor query processing based on Inverted Grid Index
    Ji, Changqing
    Li, Zhiyang
    Qu, Wenyu
    Xu, Yujie
    Li, Yuanyuan
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2014, 44 : 172 - 182