Development of Multiple Big Data Analytics Platforms with Rapid Response

被引:2
作者
Chang, Bao Rong [1 ]
Lee, Yun-Da [1 ]
Liao, Po-Hao [1 ]
机构
[1] Natl Univ Kaohsiung, Dept Comp Sci & Informat Engn, 700 Kaohsiung Univ Rd, Kaohsiung 811, Taiwan
关键词
BUSINESS INTELLIGENCE;
D O I
10.1155/2017/6972461
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The crucial problem of the integration of multiple platforms is how to adapt for their own computing features so as to execute the assignments most efficiently and gain the best outcome. This paper introduced the new approaches to big data platform, RHhadoop and SparkR, and integrated them to form a high-performance big data analytics with multiple platforms as part of business intelligence (BI) to carry out rapid data retrieval and analytics with R programming. This paper aims to develop the optimization for job scheduling using MSHEFT algorithm and implement the optimized platform selection based on computing features for improving the system throughput significantly. In addition, users would simply give R commands rather than run Java or Scala programto perform the data retrieval and analytics in the proposed platforms. As a result, according to performance index calculated for various methods, although the optimized platform selection can reduce the execution time for the data retrieval and analytics significantly, furthermore scheduling optimization definitely increases the system efficiency a lot.
引用
收藏
页数:13
相关论文
共 25 条
  • [11] Integration and optimization of multiple big data processing platforms
    Chang, Bao-Rong
    Tsai, Hsiu-Fen
    Tsai, Yun-Che
    Kuo, Chin-Fu
    Chen, Chi-Chung
    [J]. ENGINEERING COMPUTATIONS, 2016, 33 (06) : 1680 - 1704
  • [12] An Overview of Business Intelligence Technology
    Chaudhuri, Surajit
    Dayal, Umeshwar
    Narasayya, Vivek
    [J]. COMMUNICATIONS OF THE ACM, 2011, 54 (08) : 88 - 98
  • [13] Chen HC, 2012, MIS QUART, V36, P1165
  • [14] George L., 2011, Hbase: The definitive guide
  • [15] Heinrich J., 2015, P BIG DAT VIS AN BDV
  • [16] Hoffman Steve., 2013, Apache Flume: Distributed Log Collection for Hadoop
  • [17] Karun AK, 2013, 2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICT 2013), P132
  • [18] Li G., 2017, P SPARK SUMM 2013 SA
  • [19] Maurya M, 2012, PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, P505, DOI 10.1109/WICT.2012.6409130
  • [20] Web semantics in the clouds
    Mika, Peter
    Tummarello, Giovanni
    [J]. IEEE INTELLIGENT SYSTEMS, 2008, 23 (05) : 82 - 87