A Cloud-based Efficient On-line Analytical Processing System with Inverted Data Model

被引:2
作者
Huang, Sheng-Wei [1 ]
Shieh, Ce-Kuen [1 ]
Liao, Che-Ching [1 ]
Chiu, Chui-Ming [1 ]
Tsai, Ming-Fong [2 ]
Chen, Lien-Wu [2 ]
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Inst Comp & Commun Engn, Tainan, Taiwan
[2] Feng Chia Univ, Dept Informat Engn & Comp Sci, Taichung, Taiwan
来源
PROCEEDINGS OF THE 11TH EAI INTERNATIONAL CONFERENCE ON HETEROGENEOUS NETWORKING FOR QUALITY, RELIABILITY, SECURITY AND ROBUSTNESS | 2015年
关键词
D O I
10.4108/eai.19-8-2015.2261409
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
On-line analytical processing (OLAP) provides analysis of multi-dimensional data stored in a database and achieves great success in many applications such as sales, marketing,. financial data analysis. OLAP operation is a dominant part of data analysis especially when addressing a large amount of data. With the emergence of the MapReduce paradigm and cloud technology, OLAP operation can be processed on big data that resides in scalable, distributed storage. However, current MapReduce implementations of OLAP operation processing have a major performance drawback caused by improper processing procedure. This is crucial when dimension or dependent attributes are large, which is a common case for most data warehouses hold nowadays. To solve this issue, this paper proposes a methodology to accelerate the performance of OLAP operation processing on big data. We have conducted the experiments on the basic algebra of OLAP operation with different data sizes to demonstrate the effectiveness of our system.
引用
收藏
页码:341 / 345
页数:5
相关论文
共 14 条
  • [1] Borthakur D., HDFS ARCHITECTURE GU
  • [2] Bigtable: A distributed storage system for structured data
    Chang, Fay
    Dean, Jeffrey
    Ghemawat, Sanjay
    Hsieh, Wilson C.
    Wallach, Deborah A.
    Burrows, Mike
    Chandra, Tushar
    Fikes, Andrew
    Gruber, Robert E.
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (02):
  • [3] Chaudhuri S., 1997, SIGMOD Record, V26, P65, DOI 10.1145/248603.248616
  • [4] A COMPUTING PROCEDURE FOR QUANTIFICATION THEORY
    DAVIS, M
    PUTNAM, H
    [J]. JOURNAL OF THE ACM, 1960, 7 (03) : 201 - 215
  • [5] Mapreduce: Simplified data processing on large clusters
    Dean, Jeffrey
    Ghemawat, Sanjay
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (01) : 107 - 113
  • [6] Elsmari R., 2010, FUNDAMENTALS DATABAS
  • [7] Giovinazzo W.A., 2000, OBJECT ORIENTED DATA
  • [8] Han J., 2006, Data Mining, Southeast Asia Edition: Concepts and Techniques
  • [9] Inmon W. H., 2005, ''Building Data Warehouse
  • [10] Inmon William H., 2005, P DAT ENG ICDE 2010