Distributed Skyline Computation of Vertically Splitted Databases by Using MapReduce

被引:6
作者
Siddique, Md. Anisuzzaman [1 ]
Tian, Hao [1 ]
Morimoto, Yasuhiko [1 ]
机构
[1] Hiroshima Univ, Grad Sch Engn, Higashihiroshima 7398521, Japan
来源
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014 | 2014年 / 8505卷
关键词
Skyline query; MapReduce; Privacy; Sensitive database;
D O I
10.1007/978-3-662-43984-5_3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Skyline query retrieve objects that are not dominated by another object. A result of a skyline query is relatively small, does not contain less important objects, and is useful for selecting an object. In this paper, we consider a method for computing skyline query in MapReduce framework, which is a de facto standard in big data analysis. Currently, we have to be aware of data disclosure. Therefore, we propose a distributed computation method, in which each computer uses only a projected database that is vertically splitted from an original database, for computing skyline query. Since one computer can see only projected values, sensitive information in a database can be localized in the proposed method in addition to the advantage of the efficiency of MapReduce. Extensive experiments demonstrate the efficiency of proposed algorithm for synthetic datasets.
引用
收藏
页码:33 / 45
页数:13
相关论文
共 21 条
  • [1] [Anonymous], 2010, P ACM SIGMOD INT C M, DOI DOI 10.1145/1807167.1807273
  • [2] Balke WT, 2004, LECT NOTES COMPUT SC, V2992, P256
  • [3] The Skyline operator
    Börzsönyi, S
    Kossmann, D
    Stocker, K
    [J]. 17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, : 421 - 430
  • [4] Chan Chee-Yong., 2006, PROC ACM SPECIAL INT, P503
  • [5] Skyline with presorting
    Chomicki, J
    Godfrey, P
    Gryz, J
    Liang, DM
    [J]. 19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 717 - 719
  • [6] Dellis E., 2007, Proceedings of the 33rd international conference on Very large data bases, P291
  • [7] MAP-JOIN-REDUCE: Toward Scalable and Efficient Data Analysis on Large Clusters
    Jiang, Dawei
    Tung, Anthony K. H.
    Chen, Gang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (09) : 1299 - 1311
  • [8] Kian-Lee Tan, 2001, Proceedings of the 27th International Conference on Very Large Data Bases, P301
  • [9] Lappas T, 2010, LECT NOTES ARTIF INT, V6322, P195, DOI 10.1007/978-3-642-15883-4_13
  • [10] Lee J., 2010, ICDE, P1113