The Performance Analysis of Distributed Storage Systems Used in Scalable Web Systems

被引:0
|
作者
Oles, Dominik [1 ]
Nowak, Ziemowit [2 ]
机构
[1] Tieto Czech Sro, 28 Rijna 3346-91, Ostrava 70200, Czech Republic
[2] Wroclaw Univ Sci & Technol, Fac Comp Sci & Management, Wybrzeze Wyspianskiego 27, PL-50370 Wroclaw, Poland
来源
INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, ISAT 2018, PT I | 2019年 / 852卷
关键词
Big Data; Hadoop; HBase; Kudu;
D O I
10.1007/978-3-319-99981-4_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scalable web systems are directly related to distributed storage systems used to process large amounts of data (big data). An example of such a system is Hadoop with its many extensions supporting data storage such as SQL-on-Hadoop systems and the "Parquet" file format. Another kind of systems for storing and processing big data are NoSQL databases, such as HBase, which are used in applications requiring fast random access. The Kudu system was created to combine the advantages of Hadoop and HBase and enable both effective data set analysis and fast random access. As subject of the research, performance analysis of the mentioned systems was performed. The experiment was conducted in the Amazon Web Services public cloud environment, where the cluster of nine virtual machines was configured. For research purpose, containing about billion rows fragment of "Wikipedia Page Traffic Statistics" public dataset was used. The results of the measurements confirm that the Kudu system is a promising alternative to the commonly used technologies.
引用
收藏
页码:287 / 298
页数:12
相关论文
共 50 条
  • [21] Statistical Analysis Methods for Interdependency Communication in Distributed Systems
    Badri, Sahar
    Fergus, Paul
    Hurst, William
    2016 9TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2016), 2016, : 273 - 278
  • [22] An Approach to Improve Load Balancing in Distributed Storage Systems for NoSQL Databases: MongoDB
    Sudhakar
    Pandey, Shivendra Kumar
    PROGRESS IN COMPUTING, ANALYTICS AND NETWORKING, ICCAN 2017, 2018, 710 : 529 - 538
  • [23] COOPERATION OF SIMULATION AND DATA MODEL FOR PERFORMANCE ANALYSIS OF COMPLEX SYSTEMS
    Kim, B. S.
    Kim, T. G.
    INTERNATIONAL JOURNAL OF SIMULATION MODELLING, 2019, 18 (04) : 608 - 619
  • [24] Performance Comparison on the Heterogeneous File System in Cloud Storage Systems
    Chen, Wei-Peng
    Liu, Chuan-Ming
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2016, : 694 - 701
  • [25] Optimized distributed systems achieve significant performance improvement on sorted merging of massive VCF files
    Sun, Xiaobo
    Gao, Jingjing
    Jin, Peng
    Eng, Celeste
    Burchard, Esteban G.
    Beaty, Terri H.
    Ruczinski, Ingo
    Mathias, Rasika A.
    Barnes, Kathleen
    Wang, Fusheng
    Qin, Zhaohui S.
    GIGASCIENCE, 2018, 7 (06):
  • [26] HPS-HDS: High Performance Scheduling for Heterogeneous Distributed Systems
    Pop, Florin
    Iosup, Alexandru
    Prodan, Radu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 78 : 242 - 244
  • [27] Dynamic erasure coding decision for modern block-oriented distributed storage systems
    Ahn, Hoo-Young
    Lee, Kyong-Ha
    Lee, Yoon-Joon
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (04): : 1312 - 1341
  • [28] Dynamic erasure coding decision for modern block-oriented distributed storage systems
    Hoo-Young Ahn
    Kyong-Ha Lee
    Yoon-Joon Lee
    The Journal of Supercomputing, 2016, 72 : 1312 - 1341
  • [29] A distributed data storage and processing framework for next-generation residential distribution systems
    Zhang, Ni
    Yan, Yu
    Xu, Shengyao
    Su, Wencong
    ELECTRIC POWER SYSTEMS RESEARCH, 2014, 116 : 174 - 181
  • [30] Large-Scale Data Storage and Management Scheme Based on Distributed Database Systems
    Sun, Qiao
    Deng, Bu-qiao
    Fu, Lan-mei
    Wang, Zhi-qiang
    Pei, Xu-bin
    Sun, Jia-Song
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND INTELLIGENT MANUFACTURING (ITIM 2017), 2017, 142 : 14 - 17