IMPLEMENTING JOINS OVER HBASE ON CLOUD PLATFORM

被引:6
作者
Gadkari, Ajinkya [1 ]
Nikam, V. B. [1 ]
Meshram, B. B. [1 ]
机构
[1] VJTI, Dept Comp Engn & Informat Technol, Mumbai, Maharashtra, India
来源
2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT) | 2014年
关键词
Query Language; HBase; Cloud Databases; NoSQL; Join; Complex Queries; MapReduce Join;
D O I
10.1109/CIT.2014.77
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Amount of data and number of database accesses has increased enormously. Traditional databases are unable to fulfil these requirements. Along with the increased amount of data and number of accesses, data is becoming more unstructured. Relational database could not only serve these purposes efficiently but also they add a limitation on the size of data; upcoming cloud databases overcome these limitations. Cloud database can handle very huge amount of data and large number of database accesses. Also cloud databases support semi-structured and unstructured data along with the structured data. HBase is a cloud database which is an open source, non-relational, distributed database. HBase does not support SQL queries. HBase provides its own APIs to access data. Moreover HBase does not provide support for Joins and nested queries. The Joins were dropped from NoSQL databases because they add processing overhead which in case of huge amount of data becomes significantly large. But join is the way to combine results from one or more tables with very less code. In this paper we propose a layer over HBase which will support Joins over HBase. This layer will work and interact between user and HBase and will make use of HBase APIs for accessing the data, which is stored in the underlined HDFS. Developers will be able to use this layer as API in their program by just including the layer libraries in the program.
引用
收藏
页码:547 / 554
页数:8
相关论文
共 11 条
  • [1] Afrati Foto N., ACM EDBT 2010, P20010
  • [2] [Anonymous], 2010, P ACM SIGMOD INT C M, DOI DOI 10.1145/1807167.1807273
  • [3] Atzeni P, 2013, SIGMOD REC, V42, P64
  • [4] Bigtable: A distributed storage system for structured data
    Chang, Fay
    Dean, Jeffrey
    Ghemawat, Sanjay
    Hsieh, Wilson C.
    Wallach, Deborah A.
    Burrows, Mike
    Chandra, Tushar
    Fikes, Andrew
    Gruber, Robert E.
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (02):
  • [5] Optimization of parallel execution for multi-join queries
    Chen, MS
    Yu, PS
    Wu, KL
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (03) : 416 - 428
  • [6] Tiled-MapReduce: Optimizing Resource Usages of Data-parallel Applications on Multicore with Tiling
    Chen, Rong
    Chen, Haibo
    Zang, Binyu
    [J]. PACT 2010: PROCEEDINGS OF THE NINETEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2010, : 523 - 534
  • [7] Gond Shrikant, 2013, INT J ENG RES APPL, V3, P520
  • [8] Gupta H., 2013, EDBT, P113
  • [9] Hecht Robin, 2011 IEEE INT C CLOU, P336
  • [10] Nikam VB, J ENG COMPUTERS APPL, V2