An Optimized Distributed OLAP System for Big Data

被引:0
作者
Chen, Wenhao [1 ]
Wang, Haoxiang [1 ]
Zhang, Xingming [1 ]
Lin, Qidi [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
来源
2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA) | 2017年
关键词
big data; decision making; OLAP;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To solve the problems of heterogeneous data types and large amount of calculation in making decision for big data, an optimized distributed OLAP system for big data is proposed in this paper. The system provides data acquisition for different data sources, and supports two types of OLAP engines, Impala and Kylin. First of all, the architecture of the system is proposed, consisting of four modules, data acquisition, data storage, OLAP analysis and data visualization, and the specific implementation of each module is descripted in great detail. Then the optimization of the system is put forward, which is automatic metadata configuration and the cache for OLAP query. Finally, the performance test of the system is conduct to demonstrate that the efficiency of the system is significantly better than the traditional solution.
引用
收藏
页码:36 / 40
页数:5
相关论文
共 16 条
  • [11] Li Y., 2017, APACHE KYLIN EBAY EX
  • [12] LiJun Wang, 2011, 2011 2nd International Conference on Artificial Intelligence, Management Science and Electronic Commerce (AIMSEC 2011), P2935, DOI 10.1109/AIMSEC.2011.6010846
  • [13] Metz C, 2012, GOOGLES DREMEL MAKES
  • [14] Meyer M., 2006, Proceedings of the 2006 ACM symposium on Software visualization, P135
  • [15] What-If Query Processing Policy for Big Data in OLAP System
    Xu, Huan
    Luo, Hao
    He, Jieyue
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 110 - 116
  • [16] Yu B, 2017, IEEE T CLOUD COMPUT, P1